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an d AE3 . the vector contains the cis-aciing packaging sequence of the wild type 
adenovirus genome, and a gene which encodes an HIV Pol protein (SEP ID NO: I) 
inserted in El . 



The claim is directed to an adenoviral vector that is at least partially deleted of AEl^ 
andAE3 , the vector contains the cis-acting packaging sequence of the wild type 
adenovirus genome, and a gene v*dch encodes an HIV Pol protein (SEQ ID NO: 5) 
inserted in El. 



The claim is directed to an adenoviral vector that is al least partially deleted of AEl. 
andAK . the vector coQtams ihe cis-acting packaging sequence of the wild type 
adeno^drus genome . and a gene v*iich encodes an HIV Pol protein (SEQ ID NO: 7) 

inserted in El . 

The claims are directed to a method of making and harvesting of a recombinant 
adenoviral particle that contains a gene encoding an HIV Pol protein. 



The claim is directed to a method of generating a cellular mediated innoine response 
to Ht V Pol protein with the recombinant adenoviral particle. 
The claim is directed to a method of generating a cellular nwdiated immune response 
to HIV Pol protein with the recombinant adenoviral particle in addition to 
administering a DNA plasmid vaccine. 



The claims are directed to an adenowal vector that is at least partially deleted of 
AEl , the vector contains the cis-acting packaging sequence of the wild type 
idSovirus genome, and a gene winch encodes an HIV Nef prote in (SEO ID NO: 9) 
inserted in the parallel orientation of EL 



The claims are directed to an adeno>aral vector that is at least partially deleted of 
AEl the vector contains the cis-acting packaging sequence of the wild type 
IdCTOvirus genome, and a gene which encodes an HIV Ncf protei n (SEQ ID NO: 1 1) 

inserted in the parallel orientation of El . — 

The claims are directed to an adenoviral vector that is at least partially deleted of 
AEl the vector contains the cis-acting packaging sequence of the wild type 
Sfflttovirus genome . and a gene which encodes an HIV Nef prolem ^SEQ ID NO: 13) 
inserted in the parallel orientation of El . 
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The claims are directed to an adenoviral vector that is at least partially deleted of 
AE 1 . the vector concains the cis-acting packaging sequence of the wild type 
SSovirus genome, and a gene wWch encodes an HIV Nef protei n (SEQ ID NO: 15) 

inserted in the parallel orientation of El. 

Tte claim is directed to an adenoviral vector that is at least partially deleted of AEl , 
the vector contains the cis-acting packaging sequence of the wild type adenovirus 
genome, and a gene wtoch encodes an HIV Nef protein (SEQ ID NO: 9) inserted in 

the antiparallel orientation of El. , - 

The daim is directed to an adenoviral vector that is at least partially deleted of AEl. 
the vector contains the cis-acting packaging sequence of the wUd type adenovirus 
genome, and a gene which encodes an HIV Nef protein (SEQ ID NO: 11) inserted in 

the antiparallel orientation of El. -- 

The claim is directed to an adenoviral vector that is at least partially deleted of AEl,. 
the vector contains the cis-acting packa^g sequence of the wUd type adenovirus 
genome, and a gene which encodes an HIV Nef protein (SEQ ID NO: 13) inserted in 

the antiparallel orientation of El . . 

The claim is (firected to an adenoviral vector that is at least partially deleted of AEl. 
the vector contains the cis-acting packaging sequence of the wUd type adenovirus 
genome, and a gene which encodes an HIV Nef protein (SEQ ID NO: 15) inserted in 
the antiparalld orientation of El 



The claim is directed to an adenoviral vector that is at least partially deleted of AEl 
andA£3 . the vector contains ihe cis-acting packaging sequence of the wild type 
adenovirus genome, and a gene which encodes an HIV Nef protei n (SEQ ID NO: 9) 
inserted in El 



The claim is directed to an adenoviral vector that is at least partially deleted of AEl, 
andAE3 . the vector contains the cis-acting packaging sequence of the wild type 
adenovirus genome, and a gene which encodes an HIV Nef protei n (SEQ ID NO: 1 1) 
inserted in El 



The claim is directed to an adenoviral vector that is at least partially deleted of AEl 

and AE3 . the vector contains the cis-acting packaginjz sequence ol' the wild type 
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The claim is directed to an adenoviral vector thai is at least partifllly deleted of At], 
and AE3 . the vector contains the cis-acting packaging sequence of the wild type 
jjpn^.^-,- ^^^rs^ f. t»hieh encodes an fflV Pol protem (SEO ID NO: 5) 

inserted in E 1 . „____««____^ 




"55 


The daim is directed to an adenoviral vector that is at least partially deleted of AEl 
and AE3 . the vector coniams the cis-acting packaging sequence of the wOd type 
P^.p^.««,<. 2*'""'^ * B*"* encodes an HIV Pol protein (SEO ID NO: 7) 
inserted in E 1 . - __ 
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The claims are cfirected to a method of making and harvesting of a recombinant 
adenoviral particle that contains a gene encoding an HIV Pol protein. 
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Tl» claim is directed to a method of generating a cellular mediated immune response 
to HIV Pol protein with the recombinant adenoviral particle. 
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The claim is directed to a method of generating a cellulax mediated immune re^nse 

fflv pfti prnf«»in with the recombinant adenoviral oarticle in addidon to 
administering a DN A plasmid vaccine. 
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The claims are directed to an adenoviral vector that is at least partially deleted of 
AEl, the vector contains the cis-acting packaging sequence of the wild type 
TAl^^^r.xK ^pnnnwt and a pene vMch encodes an HIV Nef protein (SEO ID NO: 9) 
inserted in the parallel orientation of El . 
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The claims are directed to an adenoviral vector that is at least partially deleted of 
AEl, the vector contains the cis-acting packaging sequence of the wild type 
a/i»n«v,'r»ic pAonme and a cene which encodes an HIV Nef protein (SEO ID NO: 1 1) 
inserted in the parallel orientation of El . 
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The claims are directed to an adenoviral vector that is at least partially deleted of 
AE 1 , the vector contains the cis-acting packaging sequence of the wild type 
fldftnnvirus genome, and a gene wtoich encodes an HIV Nef protem (SEQ ID NO: 15) 
^^Rerted in the parallel orieniation of E 1 . 
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The claim is directed to an adenoviral vector that is at least partially deleted of AEl. 
the vector contauis the cis-acting packaging sequence of the wild type adenovirus 
f^moiTw Md a ^ene which encodes an HIV Nef protein (SEO ID NO: 9) inserted in 
the antiparallel orientation of El . 
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The claim is directed to an adenoviral vector that is at least partially deleted ot AEl. 
the vector contains the cis-acting packaging sequence of the wild type adenovirus 
£^nnmf» und a gene which encodes an HIV Nef protein (SEO ID NO: 1 1) inserted in 
the antiparallel orientation of El. 
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The clahn is directed to an adenoviral vector that is at least partially deleted of AEl, 
the vector contains the cis-acting packaging sequence of the wild tj'pe adenovirus 
£PnftTne and a pene wAdch encodes an HIV Nef protein (SEQ ID NO: 1 3) inserted in 
the antiparallel orientation of El. 
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The claim is directed to an adenoviral vector that is at least partially deleted of AEl. 
the vector contains the cis-acting packaging sequence of the wild type adenovirus 
f pnnme. and a pene which encodes an HIV Nef protein (SEQ ID NO: 1 5) inserted in 
the antiparallel orientation of £1. 
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The claim is directed to an adenoviral vector that is at least partially deleted ot Afcl 
and AE3 , the vector contains the cis-acting packaging sequence of the wild type 
nrfMM^vinw penome. and a eene vtoch encodes an HIV Nef protein (SEQ ID NO: 9) 
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The claim is directed to an adenoviral vector thai is at least partially deleted of AEl 
and AE3 , the vector contains the cis-acting packaging sequence of the wild type 
«Hminvirus penome. and a jzene which encodes an HIV Nef protein (SEQ ID NO: 1 1) 
inserted in E 1 . 
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The clahn is directed to an adenoviral vector that is at least partially deleted of AEl 
and AE3 . the vector contains the cis-acting packaging set|iJcnce of the wild type 
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TS^;;^hnonK,z^d^&i^ which encodes an HIV Nef proie in (SEQJDNOUj l 

^^"rdirecied to an adenovnal vecior that is at least partially deleted of Mi 
L^IIT le Zor contains the cis-acting packaging sequence of ^hejald^J^ 

genome, and a gene encodes an HW Nef pr otein (SEQ IDNOLLjl 

inserted in El . 

The claims ure directed to a method of making and harvestii-g «t a recombinant 

Xwviral n.r .icle ihat comain » a gene encoding «° HIV Nef protein 

The cJaims aw directed to a method of generating a cellular mediated immune 

res>onse to HIV Nef with the recombinant adenoviral parlicle. 

The daims are directed to a method of generaUng a cellular mediated imimne 
tespoose to HIV Nef with the recombinant adenoviral parUcle maddiuoiito 

.rtministerine a P NA plasmid vaccine. i A.^^^rmesstd 

The claim is drawi to a multivalent vaccine wherein gag. pol ano «e/are expreised 

fmm three individual vectors. , 

ThTclaims are drawn to a multivalent va ccine ^^in gag. pol ana nej are expressed 

from one individual vectors. 



from one maiviautu vcutma. ■ . 

"The claims arc dravm to a mulUvalent vaccine therein gag, pot ana my 
fZa t«o individual vectors, one expressing nef^pol and one expressmg gag. 
Tte claims are draw to a fflultivia«at vaccine v4>erein gag. pol and »e/are trpn^s^i 
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STdaims are drawn to a multivatol vac cine ^rfierein gag. pol mi nef ^ expressed 

from a rinpiK veemrs as a ftision protein. ^ — — - 

The daims are drawn to a multivalent vaccine vtoerein gag mi pol are expressed 
frnm two individual vectors. 



from two in mviuuai ; 

llje claims are drawn to a niultivaleni vaccine wherein gag and pol arc expressed 
ir>r4tviAially from one vector. 



intiiviQuaiiv iiuiuiv"^ Tw>»».>i.. — ' . 

The claims are drawn to a multivalent vaccine >A4ierein poi and'ne/ are expressed 

frnm twD individual vec tors. . — _ — — -r- 

The claims are drawn to a multivalent vaccine wherein pot and nef are expressed 
from individually from one vector. 
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(57) Abstract: First generation adenoviral vectors and associated recombinant adenoviius-based HTV vaccines which show en> 
hanced stability and growth properties and greater cellular-mediated immunity are described within this specification. These ade- 
novira] vectors are utilized to generate and produce through cell culture various adenoviral-based HIV-1 vaccines which contain 
HTV-l gag, HIV-1 pol and/or HIV-1 nef polynucleotide pharmaceutical products, and biologically relevant modifications thereof. 
These adenovirus vaccines, when directly introduced into living vertebrate tissue, preferably a manmialian host such as a human or a 
non-human mamma] of commercial or domestic veterinary importance, express the HTV 1- Gag, Pol and/or Nef protein or biologically 
modification thereof, inducing a cellular inmiune response which specifically recognizes HIV-1. The exemplified polynucleotides 
of the present invention are synthetic DNA molecules encoding HIV>1 Gag, encoding codon optimized HTV-l Pol, derivatives of 
optimized HTV-l Pol (including constructs wherein protease, reverse transcriptase, RNAse H and integrase activity of HTV-l Pol 
is inactivated), HTV-l Nef and derivatives of optimized HTV-l Nef, including nef mutants which effect wild type characteristics of 
Nef, such as myristylation and down regulation of host CD4. The adenoviral vaccines of the present invention, when administered 
alone or in a combined modality regime, will offer a prophylactic advantage to previously uninfected individuals and/or provide 
a therapeutic effect by reducing viral load levels within an infected individual, thus prolonging the asymptomatic phase of HIV-1 
infection. 
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THIJE OF THE INVENTION 

ENHANCED HRST GENERATION ADENOVIRUS VACCINES EXPRESSING 
CODON OPTIMIZED HTVl-GAG, POL, NEF AND MODMCATIONS 

CROSS-REFERENCE TO RELAIED APPUCATIONS 

This application claims the benefit, under 35 U.S.C. §119(e), of U.S. 
provisional applications 60/233,180, 60/279,056, and Attorney Docket 20867PV2 
(serial number unassigned), filed September 15, 2000, March 27, 2001, and 
September 7, 2001 , respectively. 

STATEMENT REGARDING FEDERALLY-SPONSORED R&D 
Not Applicable 

REFERENCE TO MICROFICHE APPENDIX 
15 Not Applicable 

FIELD OF THE INVENTION 

The present invention relates to recombinant, replication-deficient first 
generation adenovirus vaccines found to exhibit enhanced growth properties and 

20 greater cellular-mediated immunity as compared to other replication-deficient vectors. 
The invention also relates to the associated first generation adenoviral vectors 
described herein, which, through the incorporation of additional 5' adenovirus 
sequence, enhance large scale production efficiency of the recombinant, replication- 
defective adenovirus described herein. Another aspect of the instant invention is the 

25 surprising discovery that the intron A portion of the human cytomegalovirus (hCMV) 
promoter constitutes a region of instability in adenoviral vector constructs. Removal 
of this region from adenoviral expression constructs results in greatly improved vector 
stability. Therefore, improved vectors expressing a transgene under the control of an 
intron A-deleted CMV promoter constitute a further aspect of this invention. These 

30 adenoviral vectors are useful for generating recombinant adenovirus vaccines against 
human immunodeficiency virus (HIV). In particular, the first generation adenovirus 
vectors disclosed herein are utilized to construct and generate adenovirus-based HIV- 
1 vaccines which contain HIV-1 Gag, HTV-l Pol and/or IDV-l Nef polynucleotide 
pharmaceutical products, and biologically active modifications thereof. Host 

35 administration of the recombinant, replication-deficient adenovirus vaccines described 
herein results in expression of HIV- 1 Gag, HTV-l- Pol and/or Nef protein or 

1 
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immunologically relevant modifications thereof, inducing a cellular immune response 
which specifically recognizes HIV-1. The exemplified polynucleotides of the present 
invention are synthetic DNA molecules encoding codon optimized HIV-1 Gag, HIV-1 
Pol, derivatives of optimized HIV-1 Pol Cncluding constructs wherein protease, 

5 reverse transcriptase, RNAse H and integrase activity of HIV-1 Pol is inactivated), 
HIV-1 Nef, and derivatives of optimized HIV-1 Nef, including nef mutants which 
effect wild type characteristics of Nef, such as myristyMon and down regulation of 
host CD4. The HIV adenovirus vaccines of the present invention, when administered 
alone or in a combined modality and/or prime/boost regimen, wiU offer a prophylactic 

10 advantage to previously uninfected individuals and/or provide a therapeutic effect by 
reducing viral load levels vwthin an infected individual, thus prolonging ttie 
asymptomatic phase of HIV-1 infection. 

BACKGROUND OF THE INVENTION 

15 Human Immunodeficiency Virus-l (HHV-l) is the etiological agent of 

acquired human immune deficiency syndrome (AIDS) and related disorders. KV-l 
. is an RNA virus of the Retroviridae family and exhibits the 5'l.TR-gag-pol-env- 
LTR 3' organization of aU retroviruses. The integrated form of HIV-1, known as the 
provirus, is approximately 9.8 Kb in length. Each end of the viral genome contains 

20 flanking sequences known as long terminal repeats (LTRs). The HTV genes encode at 
least nine proteins and are divided into three classes; the major structural proteins 
(Gag, Pol, and Env). the regulatory proteins (Tat and Rev); and the accessory proteins 

(Vpu, Vpr, Vif and Nef). 

The gag gene encodes a 55-kilodalton (kDa) precursor protein (p55) which is 
25 expressed fiom the unspHced viral mRNA and is proteolytically processed by the HIV 

protease, a product of the pol gene. The mature p55 protein products are pl7 

(matrix), p24 (capsid), p9 (nucleocapsid) and p6. 

The pol gene encodes proteins necessary for virus repUcation; a reverse 

transcriptase, a protease, integrase and RNAse R. These viral proteins are expressed 
30 as a Gag-Pol fusion protein, a 160 kDa precursor protein which is generated via a 

ribosomal frame shifting. The viral encoded protease proteolytically cleaves the Pol 

polypeptide away from the Gag-Pol fusion and further cleaves the Pol polypeptide to 

the mature proteins which provide protease (Pro, PIG), reverse transcriptase (RT, 

P50). integrase (IN, p3 1) and RNAse H (RNAse, pl5) activities. 
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The ne/gene encodes an early accessory HIV protein (Nef) which has been 
shown to possess several activities such as down regulating CD4 expression, 
disturbing T-cell activation and stimulating HIV infectivity. 

The env gene encodes the viral envelope glycoprotein that is translated as a 
5 160-kilodalton (kDa) precursor (gpl60) and then cleaved by a cellular protease to 
yield the external 120-kDa envelope glycoprotein (gpl20) and the transmembrane 41- 
kDa envelope glycoprotein (gp41). Gpl20 and gp41 remain associated and are 
displayedxjn the viral particles and the surface of HIV-infected cells. 

The tat gene encodes a long form and a short form of the Tat protein, a RNA 
10 binding protein which is a transcriptional transactivator essential for HIV-1 
replication. 

The rev gene encodes the 13 li)a Rev protein, a RNA binding protein. Hie 
Rev protein binds to a region of the viral RNA termed the Rev response element 
(RRE). The Rev protein promotes transfer of unspliced viral RNA from the nucleus 
15 to the cytoplasm. The Rev protein is required for HIV late gene expression and in 

turn, HIV replication. 

Gpl20 binds to the CD4/chemokine receptor present on the surface of helper 
T-lymphocytes, macrophages and other target cells in addition to other co-receptor 
molecules. X4 (macrophage tropic) virus show tropism for CD4/CXCR4 complexes 

20 while a R5 (T-cell line tropic) virus interacts with a CD4/CCR5 receptor complex. 
After gpl20 binds to CD4, gp41 mediates the fusion event responsible for virus entry. 
The virus fuses with and enters the target cell, followed by reverse transcription of its 
single stranded RNA genome into the double-stranded DNA via a RNA dependent 
DNA polymerase. Hie viral DNA, known as provirus, enters the cell nucleus, where 

25 the viral DNA directs the production of new viral RNA within the nucleus, expression 
of early and late HIV viral proteins, and subsequently the production and cellular 
release of new virus particles. Recent advances in the ability to detect viral load 
within the host shows that the primary infection results in an extremely high 
generation and tissue distribution of the virus, followed by a steady state level of virus 

30 (albeit through a continual viral production and turnover during this phase), leading 
ultimately to another burst of virus load which leads to the onset of clinical AIDS. 
Productively infected cells have a half life of several days, whereas chronically or 
latently infected cells have a 3-week half life, followed by non-productively infected 
cells which have a long half life (over 100 days) but do not significantly contribute to 

35 day to day viral loads seen throughout the course of disease. 
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Destruction of CD4 helper T lymphocytes, which are critical to iixmune 
defense, is a major cause of the progressive immune dysfunction that is the hallmark 
of HIV infection. The loss of CD4 T-cells seriously impairs the body's ability to fight 
most invaders, but it has a particularly severe impact on the defenses against viruses, 

5 fimgi, parasites and certain bacteria, including mycobacteria. 

Effective treatment regimens for HIV-l infected individuals have become 
available recently. However, these drugs will not have a significant impact on the 
disease in many parts of the world and they will have a minimal impact in halting the 
spread of infection within the human population. As is true of many other mfectious 

10 diseases, a significant epidemiologic impact on the spread of HTV-l infection will 
only occur subsequent to the development and introduction of an effective vaccine. 
There are a number of factors that have contributed to the lack of successful vaccine 
development to date. As noted above, it is now apparent that in a chronically infected 
person there existe constant virus production in spite of the presence of anti-HEV-l 

15 humoral and cellular immune responses and destruction of virally infected cells. As 
in the case of other infectious diseases, the outoome of disease is the result of a 

balance.between the kinetics and the magnitude.otthe immune response and the. 

pathogen replicative rate and accessibility to the inunune response. Pre-existing 
immunity may be more successful with an acute mfection than an evolving unmune 

20 response can be with an established infection. A second factor is the considerable 
genetic variabiUty of the virus. Although anti-HIV-1 antibodies exist that can 
neutralize HIV-l infectivity in cell culture, these antibodies are generally virus 
isolate-specific in their activity. It has proven impossible to define serological 
groupings of HIV-l using traditional methods. Rather, the virus seems to define a 

25 serological "continuum" so that individual neutralizing antibody responses, at best, 
are effective against only a handful of vkal variants. Given this latter observation, it 
would be useful to identify inununogens and related delivery technologies that are 
likely to elicit anti-HTV-l cellular immune responses. It is known that in order to 
generate CTL responses antigen must be synthesized wifhm or introduced into cells, 

30 subsequently processed into small peptides by the proteasome complex, and 

translocated into the endoplasmic reticulum/Qolgi complex secretory pathway for 
eventual association with major histocompatibility complex (MHC) class I protems. 
CDS^ T lymphocytes recognize antigen in association with class I MHC via the T cell 
receptor (TCR) and the CDS cell surface protein. Activation of naive CDS* T cells 

35 into activated effector or memory cells generally requires both TCR engagement of 
antigen as described above as well as engagement of costimulatory proteins. Optimal 
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induction of CTL responses usually requires "help" in the form of cytokines from 
CD4^ T lymphocytes which recognize antigen associated with MHC class II 
molecules via TCR and 004 engagement. 

European Patent Applications 0 638 316 (Published February 15, 1995) and 0 
5 586 076 (Published March 9, 1994), (both assigned to American Home Products 
Corporation) describe replicating adenovirus vectors carrying an HFV gene, including 
env or gag. Various treatment regimens were used with chimpanzees and dogs, some 
of which included booster adenovirus or protein plus alum treatments. 

Replication-defectivjB adenoviral vectors harboring deletions in the El region 
10 are known, and recent adenoviral vectors have incorporated the known packaging 
repeats into these vectors; e.g., see EP 0 707 071, disclosing, inter alia, an adenoviral 
vector deleted of El sequences from base pairs 459 to 3328; and U.S. Patent No. 
6,033,908, disclosing, inter alia^ an adenoviral vector deleted of base pairs 459-3510. 
The packaging efficiency of adenovirus has been taught to depend on the number of 
15 incorporated individual A (packaging) repeats; see, e.g., GrSble and Hearing, 1990 /. 
ViroL 64(5):2047-2056; Grable and Hearing, 1992/. ViroL 66(2):723-731. 

Larder, et al., (1987, Nature 327: 716-717) and Larder, et al., (1989, Proc. 
Natl Acad. ScL 86: 4803-4807) disclose site specific mutagenesis of HIV-1 RT and 
the effect such changes have on in vitro activity and infectivity related to interaction 
20 with known inhibitors of RT. 

Davies, et al. (1991, Science 252:, 88-95) disclose the crystal structure of the 
RNase H domain of HIV-1 Pol. 

Schatz, et al (1989, FEBS Lett. 257: 311-3 14) disclose that mutations 
Glu478Gln and His539Phe in a complete HIV-1 RT/RNase H DNA fragment results 
25 in defective RNase activity without effecting RT activity. 

Afizrahi, et al. (1990, Nucl Acids. Res. 18: pp. 5359-5353)* disclose additional 
mutations Asp443Asn and Asp498Asn in the RNase region of the pol gene which also 
results in defective RNase activity. The authors note that the Asp498Asn mutant was 
difficult to characterize due to instability of this mutant protein. 
30 , Leavitt, et al. (1993, J. Biol Chem. 268: 2113-2119) disclose several 

mutations, including a Asp64Val mutation, which show .differing effect on HIV-1 
integrase (IN) activity, 

Wiskerchen, et al. (1995, J. Virol. 69: 376-386) disclose singe and double 
mutants, including mutation of aspartic acid residues which effect HIV-1 IN and viral 
35 replication functions. 
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It would be of great import in the battle against AIDS to produce a 
prophylactic- and/or therapeutic-based HIV vaccine which generates a strong cellular 
immune response against an HIV infection. The present invention addresses and 
meets these needs by disclosing a class of adenovirus vaccines which, upon host 

5 administradon, express codon optimized and modified versions of the BTV-l genes, 
gag, pol and nef. These recombinant, replication-defective adenovirus vaccines may 
be administered to a host, such as a human, alone or as part of a combined modality 
regimen and/or prime-boost vaccination regimen with components of the present 
invention and/or a distinct viral HTV DNA vaccine, non-vu:al HIV DNA vaccine, HIV 

10 subunit vaccine, an HIV whole kiUed vaccine and/or a Uve attenuated HIV vaccine. 

SUMMARY OF THE INVENTION 

The present invention relates to enhanced replication-defective recombinant 
adenovirus vaccine vectors and associated recombinant, replication-deficient 

15 adenovirus vaccines which encode various forms of HIV-1 Gag, HIV-1 Pol, and/or 
HIV-1 Nef, including immunologically relevant modifications of KV-l Gag, HTV-l 
Pol and mV-l Nef. The adenovirus vaccines of the present invention express HIV- 
antigens and provide for improved cellular-mediated immune responses upon host 
administration. Potential vaccinees include but are not limited to primates and 

20 especially humans and non-human primates, and also include any non-human 
mammal of commercial or domestic veterinary importance. An effect of die 
improved recombinant adenovims-based vaccines of the present invention should be a 
lower transmission rate to previously uninfected individuals (i.e., prophylactic 
applications) and/or reduction in the levels of die viral loads witiiin an infected 

25 individual (i.e., therapeutic appUcations), so as to prolong die asymptomatic phase of 
HIV-1 infection. In particular, the present invention relates to adenoviral-based 
vaccines which encode various forms of codon optimized HIV-1 Gag (including but 
in no way limited to p55 versions of codon optimized full lengdi (EL) Gag and tPA- 
Gag fusion proteins), HIV-1 Pol, HIV-1 Nef, and selected modifications of 

30 immunological relevance. The administration, intracellular delivery and expression 
of these adenovirus vaccines elicit a host CTL and Th response. The preferred 
replication-defective recombinant adenoviral vaccine vectors include but are not 
limited to synthetic DNA molecules which (1) encode codon optimized versions of 
wild type HTV-l Gag; (2) encode codon optimized versions of HIV-1 Pol; (3) encode 

35 codon optimized versions of HIV-1 Pol fusion proteins; (4) encode codon optimized 
versions of modified HIV-1 Pol proteins and fusion proteins, including but not limited 
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to pol modifications involving residues within the catalytic regions responsible for 
RT, RNase and IN activity within the host cell; (5) encode codon optimized versions 
of wild type HIV-1 Nef; (6) codon optimized versions of HIV-1 Nef fusion proteins; 
and/or (7) codon optimized versions of HIV-1 Nef derivatives, including but not 
5 limited to ne/modifications involving introduction of an amino-tenninal leader 
sequence, removal of an amino-terminal myristylation site and/or introduction of 
dileucine motif mutations. The Nef-based fusion and modified proteins, disclosed 
within this specification and expressed from an adenoviral-based vector vaccine this 
specification, may possess altered trafficking and/or host cell function while retaining 

10 the ability to be properly presented to the host MHC I complex and in turn elicit a 
host CTL and Th response. Examples of HIV-1 Gag, Pol and/or Nef fusion proteins 
include but are not limited to fusion of a leader or signal peptide at the NH2-teriminal 
portion of the viral antigen coding region. Such a leader peptide includes but is not 
limited to a tPA leader peptide, 

15 The adenoviral vector utilized in construction of the HTV-l Gag-, HIV-1 Pol- 

and/or HTV-l Nef- based vaccines of the present invention may comprise any 
replication-defective adenoviral vector which provides for enhanced genetic stability 
of the recombinant adenoviral genome through large scale production and purification 
of tiie recombinant virus. In other words, an HIV-1 Gag-, Pol- or Nef-based 

20 adenovirus vaccine of the present invention is a purified recombinant, replication- 
defective adenovirus which is shown to be genetically stable through multiple 
passages in ceD culture and remains so during large scale production and purification 
procedures. Such a recombinant adenovirus vector and harvested adenovirus vaccine 
lends itself to large scale dose filling and subsequent worldwide distribution 

25 procedures which will be demanded of an efficacious monovalent or multivalent HIV 
vaccine. The present invention meets this basic requirement with description of a 
replication-defective adenoviral vector and vectors derived therefrom, at least 
partially deleted in El, comprising a wildtype adenovirus m-acting packaging region 
from about base pair 1 to between fxom about base pair 342 (more preferably, 400) to 

30 about base pair 458 of the wildtype adenovirus genome. A preferred embodiment of 
the instant invention comprises base pairs 1-450 of a wildtype adenovirus. In other 
preferred embodiments, the replication -defective adenoviral vector has, in addition 
thereto, a region 3' to the El-deleted region comprising base pairs 351 1-3523. 
Basepairs 342-450 (more particularly. 400-450) constitute an extension of the 

35 5'region of previously disclosed vectors carrying viral antigens, particularly HTV 
antigens (see, e.g., PCT International Application PCT/USOO/18332, published 
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January 11, 2001 (WO 01/02067), which claims priority to U.S. Provisional 
Application Serial Nos. 60/142,631 and 60/148,981, filed 7/6/1999 and 8/13/1999, 
respectively; these documents herein incorporated by reference. Applicants have 
found that extending the 5' region furdier into the El gene into the disclosed vaccine 
5 vectors incorporated elements found to be important in op tim i zin g the packaging of 
the virus. 

As compared to previous vectors not comprising basepairs from about 1 to 
between from about base pair 342 (more preferably, 400) to about base pair 458 of the 
wildtype adenovmis genome, vectors comprising the above region exhibited enhanced 

10 growth characteristics, with approximately 5-10 fold greater amplification rates, a 
more potent virus effect, allowmg lower doses of virus to be used to generate 
equivalent immunity; and a greater cellular-mediated immune response than 
replication-deficient vectors not comprising this region (basepairs 1-450). Even more 
important, adenoviral constructs derived therefrom are very stable genetically in 

15 large-scale production, particularly those con^rising an expression cassette under the 
control of a hCMV promoter devoid of intron A. This is because Applicants have 
- surprisingly found that the intron A-portion-of the-hCMV-promoter constituted a— 
region of instability when employed in adenoviral vectors, Applicants have, 
therefore, identified an enhanced adenoviral vector which is particularly suited for use 

20 in gene therapy and nucleotide-based vacdne- vectors which, favorably, lends itself to 

large scale propagation. 

A pref ared embodiment of this invention is a replication-defective adenoviral 
vector in accordance with the above description wherein the gene is inserted in the 
form of a gene expression cassette comprising (a) a nucleic acid encoding a protein or 
25 biologically active and/or immunologically relevant portion thereof; (b) a 

heterologous promoter operatively linked to the nucleic acid of part a); and, (c) a 

transcription terminator. 

In preferred embodiments, the El gene, oflier than that contained withm 
basepairs 1-450 or, alternatively, that contained within base pairs 1-450 and 3511- 

30 3523 has been deleted firom the adenoviral vector, and the gene expression cassette 
has replaced the deleted El gene. In other preferred embodiments, the replication 
defective adenovirus genome does not have a functional E3 gene, or the E3 gene has 
been deleted. Most preferably, the E3 region is present within the adenoviral genome. 
Further preferred embodiments are wherein the gene expression cassette is in an El 

35 anti-parallel (transcribed in a 3 ' to 5* direction relative to the vector baclAone) 
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orientation or, more preferably, an Bl parallel (transcribed in a 5' to 3' direction 
relative to the vector backbone) orientation. 

Further embodiments relate to a shuttle plasmid vector comprising: an 
adenoviral portion and a plasmid portion, wherein said adenovirus portion comprises: 
5 a) a replication defective adenovirus genome, at least partially deleted in El , 

comprising a wildtype adenovirus ci^-acting packaging region from about base pair 1 
to between from about base pair 342 (more preferably, 400) to about base pair 458 
(preferably, 1-450) of the wildtype adenovirus genome and, preferably, in addition 
thereto, basepairs 3511-3523 of a wildtype adenovirus sequence; and b) a gene 

10 expression cassette comprising: (a) a nucleic acid encoding a protein or biologically 
active and/or immunologically relevant portion thereof; (b) a heterologous promoter 
operatively linked to the nucleic add of part a);and (c) a transcription terminator 
and/or a polyadenylation site. 

Other aspects of this invention include a host cell comprising said adenoviral 

15 vectors and/or said shutde plasmid vectors; vaccine compositions comprising said 
vectors; and methods of producing the vectors comprising (a) introducing the 
adenoviral vector into a host cell which expresses adenoviral El protein, and (b) 
harvesting the resultant adenoviral vectors. 

To this end, the present invention particularly relates to harvested 

20 recombinant, replication defective virus derived from a host cell, such as but not 
limited to 293 cells or PER.C6® cells, including but not limited to harvested virus 
related to any of the MRKAdS vector backbones, with or without an accompanying 
transgene, including but not limited to the HTV-l antigens described herein. An 
HIV-1 vaccine is represented by any harvested, recombinant adenovirus material 

25 which expresses any one or more of the HIV-1 antigens disclosed herein. This 
harvested material may then be purified, formulated and stored prior to host 
administration. 

Another aspect of this invention is a method of generating a cellular immune 
response against a protem in an individual comprising administering to the individual 
30 an adenovirus vaccine vector comprising: 

a) a recombinant, replication defective adenoviral vector, at least partially 
deleted in El, comprising a wildtype adraovirus cw-acting adenovirus packaging 
region from about base pair 1 to between from about base pair 342 (more preferably, 
400) to about base pair 458 (preferably, 1-450) and, preferably in addition thereto, 
35 base pairs 35 1 1-3523 of a wildtype adenovirus sequence, and. 
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b) a gene expression cassette conqmsing:(i) a nucleic add CTCoding a protein 
or biologically active and/or immunologically relevant portion thereof; (ii) a 
heterologous promoter operadvely linked to the nucleic acid of part a); and (iii) a 
transcription terminator and/or a polyadenylation site. 

5 In view of the efficadous nature of the adraioviral and/or DNA plasmid 

vaccines described herein, the present invention relates to all methodology regarding 
administration of one or more of these adenoviral and/or DNA plasmid vaccines to 
provide effective immunoprophylaxis, to prevent establishment of an HIV-1 infection 
following exposure to this virus, or as a post-HTV infection ther^eutic vaccine to 

10 mitigate the acute HEV-l infection so as to result in the establishment of a lower virus 
load widi baaeficial long term consequances. As discussed herein, such a treatment 
regunen may include a monovalent or multivalent composition, various combined 
modality applications, and/or a prime/boost regimen to as to optimize antigen 
expression and a concomitant cellular-mediated and/or humoral immune response 

15 upon inoculatian mto a living vertebrate tissue. TTiMefore, the present invention 

provides for methods of using the adenoviral and/or DNA plasmid vaccines disclosed 
herein within the various parametears disclosed herein as well as any additional- 
parameters known in the art, which, upon introduction into mammalian tissue induces 

intracellular expression of the gag, pol and/or nef-based vacdnes. 
20 To this end, the present invention relates in part to methods of generating a 

cellular immune response in a vacdnee, preferably a human vaccinee, wherein the 
individual is given more than one administration of adenovirus vaccine vector, and it 
may be given in a re^men accompanied by the administration of a plasmid vaccine. 
The plasmid vaccine (also referred to herein as a "DNA plasmid vaccme" or "vaccine 

25 plasmid" comprises a nucleic add encoding a protdn or an immunologically relevant 
portion thereof, a heterologous promoter operably linked to flie nucldc add sequence, 
and a transcription terminator or a polyadenylation signal (such as bOH or SPA, 
respectively). ThNQ may be a predetermined mmimum amount of time separating the 
administrations. The individual can be given a first dose of plasmid vaccine, and then 

30 a second dose of plasmid vaccine. Alternatively, the individual may be given a first 
dose of adenovirus vacdne, and then a second dose of adenovirus vaccine. In otiier 
embodimraits, the plasmid vaccine is administered first, foUowed after a time by 
administration of the adenovirus vaccine. Conversely, the adenovirus vaccine may be 
administered first, followed by administiation of plasmid vaccine after a time. In 

35 these embodiments, an individual may be given multiple doses of the same 

adenovirus serotype in dth«a: viral vector or plasmid form, or the virus may be of 
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differing serotypes. In the alternative, a viral antigen of interest can be first delivered 
via a.viral vaccine other than an adenovirus-based vaccine, and then followed with the 
adenoviral vaccine disclosed. Alternative viral vaccines include but are not limited to 
pox vims and Venezuelan equine encephilitis virus. 
5 The present invention also relates to multivalent adenovirus vaccine 

compositions which comprise Gag, Pol and Nef components described herein; see, 
e.g„ Example 29 and Table 25. Such compositions will provide for an enhanced 
cellular inmoiune response subsequent to host administration, particularly given the 
genetic diversity of human MHCs and of circulating virus. Examples, but not 

10 limitations, include MRKAdS-vector based multivalent vaccine compositions which 
provide for a divalent (i.e., gag and nef, gag and pol, or pol and nef components) or a 
trivalent vaccine (i.e., gag, pol and nef components) composition. Such a mutlivalent 
vaccine may be filled for a single dose or may consist of multiple inoculations of each 
individually filled component; and may in addition be part of a prime/boost regimen 

15 with viral or non-viral vector vaccines as introduced in the previous paragraph. To 
this end, preferred compositions are MR£Ad5 adenovirus used in combination with 
multiple, distinct HEV antigen classes. Each HIV antigen class is subject to sequence 
manipulation, thus providing for a multitude of potential vaccine combinations; and 
such combinations are within the scope of the present invention. The utilization of 

20 such combined modalities vaccine formulation and administration increase the 

probability of eliciting an even more potent cellular immune response when compared 
to inoculation with a single modality regimen. 

The concept of a "combined modality" as disclosed herein also covers the 
altemative mode of administration whereby multiple BDY-l viral antigens may be 

25 ligated into a proper shuttie plasmid for generation of a pre-adenoviral plasmid 
comprising multiple open reading frames. For example, a trivalent vector may 
comprise a gag-pol-nef fusion, in either a E3(-) or E3(+) background, preferably a E3 
deleted backbone, or possibly a "2+1" divalent vaccine, such as a gag-pol fusion (i.e., 
codon optimized p55 gag and inactivated optimized pol; Example 29 and Table 25) 

30 witiiin tiie same MRKAd5 backbone, with each open reading frame being operatively 
linked to a distinct promoter and transcription termination sequence. Alternatively, the 
two open reading frames may be operatively linked to a single promoter, with the 
open reading frames operatively linked by an internal ribosome entry sequence 
(IRES). Therefore, a multivalent vaccine delivered as a single, or possibly a second 

35 harvested recombinant, replication-deficient adenovirus is contemplated as part of the 
present invention. 
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Thetefoie, the adenoviral vaccines and plasmid DNA vaccines of this 
invention may be administered alone, or naay be part of a prime and boost 
administration regimen. A mixed modaHty priming and booster inoculation scheme 
will result in an enhanced immune response, particularly if pre-existing anti-vector 

5 immune responses are present This one aspect of this invention is a method of 
priming a subject with the plasmid vaccine by administering the plasmid vaccine at 
least one time, allowing a predetermined length of tune to pass, and then boosting by 
administering the adenoviral vaccine. Multiple primings typicaUy, 1-4, are usually 
employed, although more may be used. The length of time between priming and 

10 boost may typically vary from about four months to a year, but oflier time frames may 
be used. In experiments with rhesus monkeys, the animals were primed four times 
with plasmid vaccmes, then were boosted 4 months later with the adenoviral vaccine. 
Their cellular immune response was notably higher than that of animals which had 
only received adenoviral vaccine. The use of a priming regbmen may be particularly 

15 preferred in situations where a person has a pre-existing anti-adenovirus immune 
response. 

It is an object of the present invention to provide-for-enhanced- 
replication-defective recombinant adenoviral vaccine vector backbones. These 
recombinant adenoviral backbones may accept one or more transgenes, which may be 
20 passaged throu^ cell culture for growth, amplification and harvest 

It is a furthar object to provide for enhanced rephcation-defective recombinant 
adenoviral vaccine vectors which encode various transgenes. 

It is also an object of the present invention to provide for a harvested 
recombinant, repUcation-defident adenovirus which shows enhanced growth and 
25 amplification rates while in combination with increased virus stability after 

continuous passage in ceU culture. Such a recombinant adenovirus is particularly 
suited for use in gene therapy and nucleotide-based vaccine vectors which, favorably, 
lends itself to large scale propagation. 

To this end, it is an object of the present invention to provide for (1) enhanced 
30 replication-defective recombinant adenoviral vaccine vectors as desaibed hereto 
which encode various forms of HIV-1 Gag, HIV-1 Pol, and/or HIV-1 Nef, including 
immunologicaUy relevant modifications of mV-1 Gag, HIV-1 Pol and HIV-1 Nef, 
and (2) harvested, purified lecombmant replication-deficient adenovirus generated by 
passage of the adenoviral vectors of (1) through one or multiple passages through cell 
35 culture, including but not limited to passage through 293 cells or PER.C6® ceUs. 
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It is also an object of the present invention to provide for recombinant 
adenovirus harvested by one or noiultiple passages through cell culture. As relating to 
recombinant adenoviral vaccine vector, this recombinant virus is harvested and 
formulated for subsequent host administration. 
5 It is also an object of the present invention to provide for replication-defective 

adenoviral vectors wherein at least one gene is inserted in the form of a gene 
expression cassette comprising (a) a nucleic acid encoding a protein or biologically 
active and/or immunologically relevant portion thereof; (b) a heterologous promoter 
operatively linked to the nucleic acid of part a); and, (c) a transcription terminator. 

10 It is also an object of the present invention to provide for a host cell 

comprising said adenoviral vectors and/or said shuttle plasmid vectors; vaccine 
compositions comprising said vectors; and methods of producing the vectors 
comprising (a) introducing the adenoviral vector into a host cell which expresses 
adenoviral El protein, and (b) harvesting the resultant adenoviral vectors. 

15 It is a further object of the presrat invention to provide for methods of genraating a 
cellular immune response against a protein in an individual comprising administering 
to the individual an adenovirus vaccine vector comprising a) a replication defective 
adenoviral vector, at least partially deleted in El, conqnising a wildtype adenovirus 
cw-acting packaging region from about base pair 1 to between from about base^ pair 

20 342 (more preferably, 400) to about 450 (preferably, 1^50) and, preferably, 3511- 
3523 of a wildtype adenovirus sequence, and, b) a gene expression cassette 
comprising:(i) a nucleic acid encoding a protein or biologically active and/or 
immunologically relevant portion thereof; (ii) a heterologous promoter operatively 
linked to the nucleic acid of part a); and (iii) a transcription terminator and/or a 

25 polyadenylation site. 

It is also an object of the present invention to provide various alternatives for 
vaccine administration regimes, namely administration of one or more adenoviral 
and/or DNA plasmid vaccines described herein to provide effective 
immunoprophylaxis for uninfected individuals or a therapeutic treatment for HIV 

30 infected patients. Such processes include but are not limited to multivalent HIV-1 
vaccine compositions, various combined modality regimes as well as various 
prime/boost alternatives. These methods of adnodnistration, relating to vaccine 
composition and/or scheduled administration, will increase the probability of eliciting 
an even more potent cellular immune response when compared to inoculation with a 

35 single modality regimen. 
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As used throughout the specification and claims, the foUowing definitions and 

abbreviations are used: 

"HAART" refers to - highly active antiretroviral therapy 
^fucst generation" vectors are characterized as being replication-defective. 
5 They typically have a deleted or inactivated El gene region, and preferably have a 
deleted or inactivated E3 gene region as well. 

"AEX7* refers to Anion Exchange chroniatography. 
*'QPA" refers to Quick PCR-based Potency Assay, 
•'bps" refers to basepairs. 
10 "s" or "str" denotes that the transgene is in the El parallel or "straight" 

orientation. 

•TBMCs" refers to periphwal blood monocyte cells. 
"FL" refers to full length. 

"FLgag" refers to a full-length optimized gag gene, as shovm in Figure 2. 
15 " Ad5-Flgag" refers to an adenovirus serotype 5 replication deficient virus 

which carries an expression cassette which comprises a full length optimized gag gene 
under the control of a CMV- promoter. _ 

"Promoter" means a recognition site on a DNA strand to which an RNA 
polymerase binds. The promoter forms an initiation complex with RNA polymerase 
20 to initiate and drive transcriptional activity. The complex can be modified by 
activating sequences such as enhancers or inhibiting sequences such as silencers. 

•'Leader" means a DNA sequence at the 5' end of a structural gene which is 
transcribed along with the gene. This usually results a protein having an N-teiminal 
peptide extension, often referred to as a pro-sequences. 
25 "Intron" means a section of DNA occurring in the middle of a gene which 

does not code for an amino acid in the gene product. The precursor RNA of the intron 
is excised and is therefore not transcribed into mRNA not translated into protein. 

^Immunologically relevant" or 'biologically active" means (1) with regards to 
a viral protein, that the protein is capable, upon administration, of eliciting a 
30 measurable immune response within an individual sufficient to retard the propagation 
and/or spread of the virus .and/or to reduce the viral load present within the individual; 
or (2) with regards to a nucleotide sequence, that the sequence is capable of encoding 
for a protein capable of the above. 

"Cassette" refers to a nucleic acid sequence which is to be expressed, along 
35 witii its transcription and translational control sequences. By changing the cassette, a 
vector can express a diflferent sequence. 
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"bGHpA" refers to the bovine growth hormone transcription 
tenninator/polyadenylation sequence. 

"tPAgag" refers to a fusion between the leader sequence of the tissue 
plasminogen activator leader sequence and an optimized HTV gag gene, as 
5 exemplified in Figure 30A-B, whether in a DNA or adenovirus-based vaccine vector. 

Where utilized, "lA" or "inact" refers to an inactivated version of a gene (e.g. 

lApol). 

"MCS" is "multiple cloning site". 

In general, adenoviral constructs, gene constructs are named by reference to 

10 the genes contained therein. For example: 

"Ad5 inV-1 gag", also referred to as the original HIV-1 gag adenoviral 
vector, is a vector containing a transgene cassette composed of a hCMV intron A 
promoter, the full length version of the human codon-optimized HIV-1 gag gene, and 
the bovine growth hormone polyadenylation signal. The transgene was inserted in the 

IS El antiparallel orientation in an £1 and E3 deleted adenovecton 

'*MRK Ad5 mV-l gag" also referred to as *mKAd5gag" or "Ad5gag2" is 
an adenoviral vector taught herein which is deleted of El, comprises basepairs 1-450 
and 3511-3523, and has a human codon-optimized HIV-1 gene in an El parallel 
orientation under the control of a CMV promoter without intron A. The construct 

20 also comprises a bovine growth hormone polyadenylation signal. 

"pVl JnsHIVgag", also referred to as "HIVFLgagPR9901", is a plasmid 
comprising the CMV immediate-early (IE) promoter and intron A, a ftdl-length 
codon-optimized HTV gag gene, a bovine growth hormone-derived polyadenylation 
and transcriptional termination sequence, and a minimal pUC backbone. 

25 "pVl JnsCMV(no intron>-FLgag-bGHpA" is a plasmid derived from 

pVlJnsHIVgag which is deleted of the intron A portion of CMV and which comprises 
the full length HIV gag gene. This plasmid is also referred to as '*pVl JnsHIVgag- 
bGHpA", pVl Jns-hCMV-FL-gag-bQHpA" and "pVlJnsCMV(no intron) + FLgag + 
bGHpA". 

30 **pVl JnsCMV(no intron)-FLgag-SPA" is a plasmid of the same composition 

as pVl JnsCMV(no. intron)-FLgag-bGHpA except that the SPA termination sequence 
replaces that of bGHpA. This plasmid is also referred to as "pVlJns-HTVgag-SPA" 
and pVl Jns-hCMV-ELgag-SPA". 

"pdelElsplA'' is a universal shuttle vector with no expression cassette (i.e., no 

35 promoter or polyA). The vector comprises wildtype adenovirus serotype 5 (Ad5) 
sequences from bp 1 to bp 341 and bp 3524 to bp 5798, and has a multiple cloning 
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site between the Ad5 sequences ending 341 bp and beginning 3524 bp. This plasmid 
is also referred to as the original Ad 5 shuttle vector. 
"MRKpdelElsplA" or "MRKpde]El(Pac/pIX/pack450r or 
"MRKpdelEl(Pac/pIX/pack450)aal" is a universal shutde vector with no expression 
5 cassette (i.e. no promoter or polyA) comprising wildtype adenovirus serotype 5 (Ad5) 
sequences from bpl to bp450 and bp 351 1 to bp 5798. The vector has a multiple 
cloning site between the Ad5 sequence ending 450 bp and beginning 351 1 bp. This 
shutde vector may be used to insert the CMV promoter and the bGHpA fragments in 
both the straight ("str". or El parallel) orientation or in the opposite (opp. or El 

10 antipaiallel) oriratation) 

"MRKpdemi(Pac/pIX/pack450)+CMVmin+BGHi>A(str.)" is still another 

shuttle vector which is the modified vector that contains the CMV promoter (no 
intronA) and the bGHpA fragments. The expression unit containing the hCMV 
promoter (no intron A) and the bovine growth hormone polyadenylalion signal has 
15 been inserted into the shutde vector such that insertion of the gene of choice at a 

unique BgUl site will ensure the direction of transcription of the transgene will be Ad5 
El parallel when inserted into die MRKpAd5(El/E3+)Clalpre-plasmid. This shuttle 
vector, as shown in Figures 22 and 23, was used to insert the respective lApol and 

G2A,LLAA nef genes directly into. 

20 "MRKpdelEl-CMV(no intron)-FLgag-bGHpA" is a shutfle comprising Ad5 

sequences from basepairs M50 and 351 1-5798, with an expression cassette 
containing human CMV without intron A, the fuU-length human codon-optimized 
HIV gag gene and bovine growth hormone polyadenylalion signal. This plasmid is 
also referred to as "MRKpdelEl shuttle +hCMV-FL-gag-BGHpA" 

25 ••MRKpAdHVE3+CMV(no intron)-FLgag-bGHpA" is an adenoviral vector 

comprising all Ad5 sequences except those nucleotides encompassing the El region 
(from 451-3510), a human CMV promoter without intron A, a full-lengfli human 
codon-optimized HIV gag gene, and a bovine growdi hormone polyadenylation 
signal. This vector is also referred to as "MRKpAdHVE3 + hCMV-FL-gag- 

30 BGHpA", "MRKpAd5HIV-lgag", "MRKpAd5gag", "pMRKAd5gag" or 
"pAd5gag2". 

"pVlJns-HIV-pol inact(opt)" or "pVlJns-HIV lA pol (opt) is the inactivated 
Pol gene (contained within SEQ ID N0:3) cloned into the Bgin site of VlJns (Figure 
17A-C). As noted herein, various derivatives of HIV-1 pol may be cloned into a 
35 plasmid expression vector such as VI Jns or VI Jns-tPA, tiius serving direcfly as DNA 
vaccine candidates or as a source for subcloning into an appropriate adenoviral vector. 
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"MRKpdel+ha!^min+FL-pol+bGHpA(s)" is the 
"MRKpdeffil(Pac/pIX/pack450)-i<;MVimn+BGH^ shuttle mentioned above 

which contains the lA pel gene is the proper orientation. This shuttle vector is used in 
a bacterial recombmation with MRKpAd(El-/E3+)ClaL 
5 "MRKpAd+hCMVmin+jPL-pol+bGHpA(S)E3+", also referred to herein as 

"pMRKAdSpol", is the pre-adenovirus plasmid which comprises a CMV-poI 
inact(opt)-pGHpA construct. The construction of this pre-adenovirus plasmid is 
shown in Figure 22. 

"pVl Jns/nef (G2A,LLAA)" or "VlJns/opt nef (G2AJLLAA)" comprises 

10 codon optimized HIV-1 Nef wherein the open reading frame codes for modifications 
at the amino terminal myristylation site (Gly-2 to Ala-2) and substitution of the Leu- 
174-Leu-175 dileucine motif to Ala-174-Ala-175 (SEQ ID NO: 13; which comprises 
an initiatmg methionine residue at nucleotides 12-14 and a 'TAA" stop codon from 
nucleotides 660-662). Hiis fragment is subcloned into the Bgl II site of VI Jns 

15 and/orVlJns-tPA (Figures 16A-B). As noted above for HIV-1 pol, HIV-1 nef 

constructs may be cloned into a plasmid expression vector such as VI Jns or VlJns- 
tPA, thus serving directly as DNA vaccine candidates or as a source for subcloning 
into an appropriate adenoviral vector. 

"MRKpdelElhCMVniinFL-nefBGHpA(s)", also referred to herein as 

20 "pMRKAdSnef is the pre-adenovirus plasmid which comprises a CMV-nef 

(G2A,LLAA) codon optimized sequence. The construction of this pre-adraovirus 
plasmid is shown in Figure 23. 

BRIEF DESCRIPTION OF THE FIGURES 

25 Figure 1 shows the original HIV-1 gag adenovector (Ad5HIV-lgag). This 

vector is disclosed in PCT International AppUcation No. PCT/USOO/18332 (WO 
01/02607) filed July 3, 2000, claiming priority to U.S. Provisional Application Serial 
No. 60/142,631, filed July 6, 1999 and U.S. Application Serial No. 60/148,981. filed 
August 13, 1999, all three applications which are hereby incorporated by reference. 

30 Figure 2 shows the nucleic acid sequence (SEQ ID NO: 29) of the optimized 

human HIV-1 gag open reading frame. 

Figure 3 shows diagrammatically the new transgene constructs in comparison 
with the original gag transgene. 

Rgure 4 shows the modifications made to the original adenovector backbone 

35 in the generation of the novel vectors of the instant invention. 
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Figure 5 shows the virus mixing experiments that were canied out to 
determine the effects of the addition made to the packaging signal region (Expt #1) 
and the E3 gene on viral growth (Expt. #2). The bars denote the region of 
modifications made to the El deletion. 

5 Figure 6 shows an autoradiograph of viral DNA analysis following the viral 

mixing experiments described in Examples 6 and 7. 

Figures 7A, 7B and 7C are as follows: Figure 7A shows, the hCMV-Hgag- 
bGHpA adenovectois constructed within the MRKpAdHVE3 and MRKpAdHVO 
adenovector backbones. Both El parallel and El antiparallel transgene orientation are 

10 lepiesKited. Figure 7B shows the hCMV-Hgag-SPA adenovectors constructed within 
the MRKpAdHVES and MRKpAdHVO adenovector backbones. Again, both El 
parallel and El antiparallel transgene orientation are represented. Figure 7C shows 
the mCMV-Flgag-bGHpA adenovectors constructed within the MRKpAdHVES and 
MRKpAdHVO adenovector backbones. Once again, botti El parallel and El 

15 antiparallel transgene orientation are r^resented. 

Figure 8 A shows the experiment designed to test the effect of transgene 

orientation. - - 

Hgure 8B shows the expaiments designed to test the effect of 

polyadenylation signal. 
20 Figure 9 shows viral DNA team the four adenoviral vectors tested (Example 

12) at P5, follovwng BsiEll digestion. 

Figure 10 shows viral DNA analysis of passages 11 and 12 of 
MRKpAdHVES, MRKAd5HIV-lgag. and MRKAd5fflV-lgagE3-. 

Figure 1 1 shows viral DNA analysis (Hindm digestion) of passage 6 
25 MRKpAdHVES and MRKAdSHTV-lgag used to initiate the viral competition study. 
Hie last two lanes are passage 1 1 analysis of dupUcate passages of the competition 
study (each virus at MOI of 280 viral particles). 

Figure 12 shows viral DNA analysis by HUid TO. digestion on high passage 
numbers for MRKAd5HIV-lgag in serum-containing media with coUections made at 
SO specified times. The first lane shows the Ikb DNA size marker. The other lanes 
represent pre-plasmid control (digested with Pacl and Hindm). MRKAd5HIV-lgag 

atP16,Fl9,andP21. 

Figure 13 shows serum anti-p24 levels at 3 wks post ijn. immunization of 
balb/c mice (n=10) witii varying doses of several Adgag constructs: (A) MRK Ad5 
35 HIV-1 gag (tiirough passage 5); (B) MRKAdS hCMV-FLgag-bGHpA (E3-); (C) 
MRKAdS hCMV-FLgag-SPA CE3+); (D) MRKAdS mCMV-FLgag-bGHpA (E3+); 
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(E) research lot (293 cell-derived) of AdSHTV-l gag; and (F) clinical lot 
(Ad5gagFN0001) of Ad5HIV-l gag. Reported are the geometric mean titers (GMT) 
for each cohort along with the standard error bars. 

Figure 14 shows a restriction map of the pMRKAd5HIV-lgag vector. 
5 Figures 15A-X illustrates the nucleotide sequence of the pMRKAdSHTV-lgag 

vector (SEQ ID NO:27, [coding] and SEQ ID NO:28 [non-coding]). 

Figures 16A-B shows a schematic representation of DNA vaccine expression 
vectors Vllns (A) and VlJns-tPA (B), which are utilized for HTV-l gag, pol and nef 
constructs in various DNA/viral vector combined modality regimens as disclosed 
10 herein. 

Figures 17A-C shows the nucleotide (SEQ ID N0:3) and amino acid sequence 
(SEQ ID NO:4) of lA-PoL Underlined codons and amino acids denote mutations, as 
listed in Table 1. 

Figure 18 shows codon optimized nucleotide and amino acid sequences 
15 through the fusion junction of tPA-poI inact(opt) (contained within SEQ ID NOs: 7 
and 8, respectively). The underlined portion represents the NH2-terminal region of 
lA-PoL 

Figures 19A-B show a nucleotide sequence comparison between wild type 
nefQrfl) and codon optimized nef. The wild type nef gene from the jrfl isolate 
20 consists of 648 nucleotides capable of encoding a 216 amino acid polypeptide. WT, 
wild type sequence (SEQ ID NO: 19); opt, oodon-optimized sequence (contained 
within SEQ ID NO:l). The Nef amino acid sequence is shown in one-letter code 
(SEQ ID N0:2). 

Figures 20A-C show nucleotide sequences at junctions between nef coding 
25 sequence and plasmid backbone of nef expression vectors VI Jns/nef (Figure 20A), 

VlJns/nef(G2A,IlAA) (Figure 20B), VlJns/tpanef (Figure 20C) and 

VI Jns/tpanef(LLAA) (Figure 20C, also). 5' and 3' flanking sequences of codon 

optimized nef or codon optimized nef mutant genes are mdicated by bold/italic letters; 

nef and nef mutant coding sequences are indicated by plain letters. Also indicated (as 
30 underlined) are the restriction endonuclease sites involved in construction of 

respective nef expression vectors. VlJns/tpanef and VlJns/tpanef(LLAA) have 

identical sequences at the junctions. 

Kgure 21 shows a schematic presentation of nef and nef derivatives. Amino 

acid residues involved in Nef derivatives are presented. Glycine 2 and Leucinel74 
35 and 175 are the sites involved in myristylation and dileucine motif, respectively. For 

both versions of the tpanef fusion genes, the. putative leader peptide cleavage sites are 
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indicated with "*", and a exogenous serine residue introduced during the construction 

of the mutants is underlined 

Figure 22 shows diagramnaatically the construction of the pre-adenovirus 

plasmid construct, MRKAdSPol. 
5 Figure 23 shows diagrammatically the construction of the pre-adenovirus 

plasmid construct, MRKAdSNef . 

Figure 24 shows a comparison of clade B vs. clade C anti-gag T cell responses 

in clade B HIV-infected subjects. 

Figure 25 shows a comparison of clade B vs. clade C anti-nef T cell responses 

10 in clade B HIV-infected subjects. 

Figures 26A-AO illustrates the nucleotide sequence of the pMRKAdSHIV- 
Ipol adenoviral vector (SEQ ID NO:32 [coding] and SEQ ID NO:33 [non-coding]), 
comprising the coding region of the inactivated pol gene (SEQ ID N03). 

Figures 27A-AM illustrates the nucleotide sequence of Ae pMRKAdSHEV-l 
15 nef adenoviral vector (SEQ ID NO:34 [coding] and SEQ ID NO:35 [non-coding]), 
comprising the coding region of the inactivated pol gene (SEQ ID NO 13). 

- Figure 28 shows the stability of MRKAdS vectors comprising various 
promoter fragments (hCMV or mCMV) and terminations signals (bGH or SPA) in 

E3(+) or E3(-) backbones. 

20 Figures 29 A and B shows the anion-exchange HPLC viral particle 

concentrations of the freeze-thaw recovered cell associated virus at the 24, 36, 48, and 
60 hpi time points (Figuie 29A) and the timcourse QPA supernatant titers (Figure 
29B) for MRKAdSgag, MRKAdSpol and MRKAdSnef . 

Figure 30 shows the nucleotide sequence (SEQ ID NO:36) and amino acid 

25 sequence (SEQ ID NO:37) comprising the open reading frame of a representative 
tPA-gag fusion for use in the DNA and/or adenoviral vaccine disclosed herein. 

Figure 31 shows the intracellular ylFN staining of PBMCs collected at week 
10 (post DNA prime) and week 30 (post Ad boost). The cells were stimulated 
overnight in the presence or absence of the gag peptide pool. They were subsequently 

30 stained using fluorescence-tagged anti-CD3, anti-CD8, anti-CD4, and anti-ylFN 

monoclonal antibodies. Each plot shows aU CD3+ T cells which were segregated in 
terms of positive staining for surface CD8 and ylFN production. The numbers in the 
upper right and lower right quadrants of each plot are the percentages of CD3'' cells 
that were CDS^ylPl>t and CD4^yIFN^, respectively. 

35 Figure 32 shows a comparison of sin^e-modality adenovirus incmunization 

with DNA + adjuvant prime/adraovirus boost immunization. 
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Figures 33A-B show the nucleotide sequence (SEQ ID NO: 38) of the open 
reading frame for the gag-IApol fusion of Example 29. 

Figures 34A-B show the protein sequence (SEQ ID NO: 39) of the gag-IApol 
fustion frame. 

5 

DETAILED DESCRIPTION OF THE INVENTTON 

A novel replication-defective, or "first generation /' adenoviral vector suitable 
for use in gene therapy or nucleotide-based vaccine vectors is described. This vector 
is at least partially deleted in El and comprises a wildtype adenovirus cw-acting 

10 packaging region from about base pair 1 to between about base pair 342 (more 
preferably, 400) to about 458 (preferably, 1-450) and, preferably, 3511-3523 of a 
wild-type adenovirus sequence. It has been found that a vector of this description 
possesses enhanced growth characteristics, with approximately 5-10 fold greater 
amplification rates, and is more potent allowing lower doses of virus to be used to 

15 generate equivalent immunity. The vector, furthermore, generates a harvested 
recombinant adenovirus which shows greater cellular-mediated immune responses 
than replication-deficient vectors not comprising this region (basepairs 342-450). 
Adenoviral constructs derived from these vectors are, further, very stable genetically, 
particulady those comprising a transgene under the control of a hCMV promoter 

20 devoid of intron A. Viruses in accordance with this description were passaged 

continually and analyzed; see Example 12. Each virus analyzed maintained it correct 
genetic structure. Analysis was also carried out under propagation conditions similar 
to that performed in large scale production. Again, the vectors were found to possess 
enhanced genetic stability; see Figure 12. Following 21 passages, the viral DNA 

25 showed no evidence of rearrangement, and was highly reproducible from one 
production lot to the next The outcome of all relevant tests indicate that the 
adenoviral vector is extremely well suited for large-scale production of recombinant, 
rephcation-deficient adenovirus, as shown herein with the data associated with Figure 

28. 

30 A preferred adenoviral vector in accordance with this description is a vector 

comprising basepairs 1-450, which is deleted in E3. This vector can accommodate up 
to approximately 7,500 base pairs of foreign DNA inserts (or exogenous genetic 
material). Another preferred vector is one retaining E3 which comprises basepairs 1- 
450. A preferred vector of this description is an E3+ vector comprising basepairs 1- 

35 450 and 351 1-3523. This vector, when deleted of the region spanning basepairs 451- 
3510, can accommodate up to approximately, 4,850 base pairs of foreign DNA inserts 
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(or exogenous genetic material). The cloning capacities of the above vectors have 
been determined using 105% of the wildtype Ad5 sequence as the upper genome size 
limit 

Wildtype adenovirus serotype 5 is used as the basis fOT the specific basepair 

5 numbers provided throughout the specification. The wildtype adenovirus serotype 5 
sequence is known and described in the art; see, Chroboczek et al., 1992 /. Virology 
186:280, which is hereby incorporated by reference. Accordingly, a particular 
embodiment of the instant invention is a vector based on the adenovirus serotype 5 
sequence. One of skill in the art can readily identify the above regions in other 

10 adenovirus serotypes (e.g.. serotypes 2, 4, 6, 12. 16, 17, 24, 31, 33, and 42). regions 
defined by basepairs corresponding to the above basepair positions given for 
adenovirus serotype 5. Accordingly, the instant invention encompasses all adenoviral 
vectors partially deleted in El comprising basepairs corresponding to 1-450 
(particularly. 342-450) and. preferably, 3511-3523 of a wild-type adenovirus serotype 

15 5 (Ad5) nucleic add sequence. Particularly preferred embodiments of the instant 
invraition are those derived from adenoviruses like Ad5 which are classified in 

subgroup C (e.g.^ Ad2). - — 

Vectors in accordance with the instant invention are at least partially deleted in 
El. Preferably the El region is completely deleted or inactivated. Most prefwably, 

20 the region deleted of El is within basepairs 451-3510. It is to be noted that the 

extended 5' and 3' regions of the disclosed vectors are beUeved to effectively reduce 
the size of the El deletion of previous constructs without overlapping any part of the 
EIA/EIB gene present in the cell line used, i.e., the PER.C6® cell line transefected 
with base pairs 459-3510. Overlap of adenoviral sequences is avoided because of Ae 

25 possibiUty of recombination. One of ordinary skiU in the art can certainly appreciate 
that the instant invention can, therefore, be modified if a different cell line transfected 
with a different segment of adenovirus DNA is utilized. For purposes of 
exemplification, a 5' region of base pairs 1 to up to 449 is more appropriate if a ceU 
line is transfected with adenoviral sequence from base pairs 450-35 10. This holds 

30 true as well in the consideration of segments 3' to fee El deletion. 

Preferred embodiments of the instant invention possess an intact E3 region 
(i.e., an E3 gene capable of encoding a functional E3). Alternate embodiments have a 
partially deleted E3, an inactivated E3 region, or a sequence completely deleted of E3. 
Applicants have found, in accordance with the instant invention, that virus comprising 

35 the E3 gene were able to amplify more rapidly compared with virus not comprising an 
E3 gene; see Figure 6 wherein a diagnostic CsQ band corresponding to the E3+ virus 
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tested (5,665 bp) was present in greater amount compared with the diagnostic band of 
3,010 bp corresponding to the E3- virus. These results were obtained following a 
virus competition study involving mixing equal MOI ratio (1:1) of adenovectors both 
comprising the E3 gene and not comprising the E3 gene. This increased amplification 
5 capacity of the E3+ adenovectors was subsequently confirmed with growth studies; 
see Table 4A, wherein the E3+ virus exhibit amplification ratios of 470, 420 and 320 
as compared with the 1 15 and 40-50 of the E3- constructs. 

As stated above, vectors in accordance with the instant invention can 
accommodate up to approximately 4,850 base pairs of exogenous genetic material for 

10 an E3+ vector and approximately 7,500 base pairs for an E3- vector. Preferably, the 
insert brings the adenoviral vector as close as possible to a wild-type genomic size 
(e.g., for Ad5, 35,935 basepairs). It is well known that admovirus amplifies best 
when they are close to their wild-type genomic size. 

The genetic material can be inserted in an El-parallel or an El anti-parallel 

15 orientation, as such is illustrated in Figure 7A, 7B, 7C and Figure 8A. Particularly 
preferred embodiments of the instant invention, have the insert in an El-parallel 
orientation. Applicants have found, via competition experiments with plasmids 
containing transgenes in differing orientation OPigure 8A), that vector constructs with 
the foreign DNA insert in an El-parallel orientation amplify better and actually out- 

20 compete El-antiparallel-oriented transgenes. Viral DNA analysis of the mixtures at 
passage 3 and certainly at passage 6, showed a greater ratio of the virus carrying the 
transgene in the El parallel orientation as compared with the El anti-parallel version. 
By passage 10, the only viral species observed was the adenovector with the transgene 
in the El parallel orientation for both transgenes tested. 

25 Adenoviral vectors in accordance with the instant invention are particularly 

well suited to effectuate expression of desired proteins, one example of which is an 
HIV protein, particularly an HIV full length gag protein. Exogenous genetic material 
encoding a protein of interest can exist in the form of an expression cassette. A gene 
expression cassette preferably comprises (a) a nucleic acid encoding a protein of 

30 interest; (b) a heterologous piromoter operatively linked to the nucleic acid encoding 
the protein; and (c) a transcription terminator. 

The transcriptional promoter is preferably recognized by an eukaryotic RNA 
polymerase. Jn a preferred embodiment, the promoter is a "strong" or "efficient" 
promoter. An example of a strong promoter is the immediate early human 

35 cytomegalovirus promoter (Chapman et al, 1991 NucL Adds /?ejl9:3979-3986, which 
is incorporated by reference), preferably without intronic sequences. Most preferred 



23 



wo 02/22080 



PCT/USOl/28861 



for use within the instant adenoviral vector is a human CMV promoter without 
intronic seqeunces, like intron A. AppUcants have found that mtron A, a portion of 
the human cytomegalovirus promoter (hCMV), constitotes a region of instabiUty for 
adenoviral vectors. CMV without intron A has been found to effectuate (Examples 1- 

5 3) comparable expression capabiUties in vitro when driving HIV gag expression and. 
furthermore, behaved equivalently to intron A-containing constructs in Balb/c mice in 
vivo with respect to their antibody and T-ceU responses at both dosages of plasmid 
DNA tested (20 jjg and 200 pg). Those skilled in the art will appreciate that any of a 
number of other known promoters, such as the strong immunoglobulin, or other 

10 eukaryotic gene promotes may also be used, including the EFl alpha promoter, the 
murine CMV promoter. Rous sarcoma virus (RSV) promoter. SV40 early/late 
promoters and the beta-actin promoter. 

In preferred embodiments, the promoter may also comprise a regulatable 
sequence such as the Tet operator sequence. This would be extremely useful, for 

15 example, in cases where the gene products are effecting a result other than that 

desired and repression is sought. 

- Preferred transcription termination sequences present within the gene 

expression cassette are the bovine growth hormone terminator/polyadenylation signal 
(bGHpA) and the short synthetic polyA signal (SPA) of 50 nucleotides in length, 
defined as follows: AATAAAAGATCnTATnTCATrAGATCTGTGTGTTGGT- 

TTTTTGTOTG (SEQ ID NO:26). 

The combination of the CMV promoter (devoid of the intron A region) with 
the BGH terminator is particularly preferred although other promoter/terminator 
combinations in the context of FG adenovirus may also be used. 
25 Other embodiments incorporate a leader or signal peptide into the transgene. 

A preferred leader is that from the tissue-specific plasminogen activator protein, tPA. 
Examples include but are not limited to the various tPA-gag, tPA-pol and tPA-nef 
adenovirus-based vaccines disclosed throughout this specification. 

In view of the improved adenovirus vectors described herein, an essential 
30 portion of the present invention are adenoviral-based HIV vaccines comprising said 
adenovinis backbones which may be administered to a mammalian host, preferably a 
human host, in either a prophylactic or therapeutic setting. The HIV vaccines of the 
present invention, whether administered alone or in combination regimens with otiier 
viral- or non-viral-based DNA vaccines, should eUcit potent and broad ceUular 
35 immmie responses against HIV that will either lessen the likelihood of persistent virus 
infection and/or lead to the establishment of a clinically significant lowered virus load 
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subject to HIV infection or in combination with HAART therapy, mitigate the effects 
of previously established HIV infection (antiviral immunotherapy(ARI)). While any 
HIV antigen (e.g., gag, pol, nef, gpl60, gp41, gpl20, tat, rev, etc.) may be utilized in 
the herein described recombinant adenoviral vectors, preferred embodiments include 
5 the codon optimized p55 gag antigen (herein exemplified as MRElAdSgag), pol and 
nef. Sequences based on different Clades of HIV-1 are suitable for use in the instant 
invention, most preferred of which are Clade B and Clade C. Particularly preferred 
embodiments are those sequences (especially, codon-optimized sequences) based on 
concensus Clade B sequences. Preferred versions of the MRKAdSpol and 

10 MRKAdSnef series of adenoviral vaccines will encode modified versions of pol or 
nef, as discussed herein. Preferred embodiments of the MRKAd5HIV-l vectors 
carrying HTV envelope genes and modifications thereof comprise the HIV codon- 
optimized env sequences of PCT International Applications PCT/US97/02294 and 
PCTAJS97/10517, published August 28, 1997 (WO 97/31115) and December 24, 

15 1997, respectively; both documents of which are hereby incorporated by reference. 

A most preferred aspect of the instant invention is the disclosed use of the 
adenoviral vector described above to effectuate expression of HTV gag. Sequences 
for many genes of many HIV strains are publicly available in GENB ANK and 
primary, field isolates of HIV are available from the National Institute of Allergy and 

20 Infectious Diseases (NIAID) which has contracted with Quality Biological 

(Gaithersburg, MD) to make these strains available. Strains are also available from 
the World Health Organization (WHO), Geneva Switzerland. It is preferred that the 
gag gene be from an HIV-1 strain (CAM-1; Myers et al, eds. "Human Retroviruses 
and AIDS: 1995, IIA3-IIA19, which is hereby incorporated by reference). This gene 

25 closely resembles the consensus amino acid sequence for the clade B (North 
American/European) sequence. Therefore, it is within the purview of the skilled 
artisan to choose an appropriate nucleotide sequence which encodes a specific HTV 
gag antigen, or immunologically relevant portion thereof. As shown in Example 25, a 
clade B or clade C based p55 gag antigen will potentially be useful on a global scale. 

30 As noted herein, the transgene of choice for insertion in to a DNA or MRKAd-based 
adenoviral vector of the present invention is a codon optimized version of p55 gag. 
Such a MRKAd5gag adenoviral vector is documented in Example 11 and is at least 
referred to herein as MRKAdSHTV-lgag. Of course, additional versions are 
contemplated, including but not limited to modifications such as promoter (e.g., 

35 mCMV for hCMV) and/or pA-terminations signal (SPA for bGH) switching, as well 
as generating MRK Ad5 backbones with or without deletion of the Ad5 E3 gene. 

25 
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The present invention also relates a series of MRKAdSpol-based adenoviral 
vaccines which are shown herein to generate cellular immune responses subsequent to 
administration in mice and non-human primate studies. Several of the MRKAdSpol 
series are exemplified herein. One such adenoviral vector is referred to as 
5 MRKAdShCMV-inact opt pol(E3+), which comprises the MRKAdS backbone, the 
hCMV promoter (no intron A), an inactivated pel transgene, and contains the Ad5 E3 
gene in the adenoviral backbone. A second exemplified pre-adenovirus plasmid and 
concomitant virus is referred to as MRKAdShCMV-inact opt pol(E3-), which is 
identical to the former adenoviral vector except that the E3 is deleted. Both 
10 constructions contain a codon optimized, inactivated version of HIV-1 Pol, wherein it 
least the entire coding region is disclosed herein as SEQ ID N0:3 and the expressed 
protein is shown as SEQ ID N0:4 (see also Figure 17A-C and Table 1, which show 
targeted deletion for inactivated pol. This and other preferred codon optimized 
versions of HIV Pol as disclosed herein are essentially as described in U.S. 
15 Apphcation Serial No. 09/745,221. filed December 21. 2000 and PCT International 
AppUcation PCr/USOO/34724. also filed December 21. 2000, both documents which 
are hereby incorporated by reference.- As disclosed in the above-mentioned 
documents, the open reading frame for these codon-optimized HIV-l Pol-based DNA 
vaccines are represented by codon optimized DNA molecules encoding codon 
20 optimized HIV-1 Pol (e.g. SEQ ID N0:2), codon optimized HIV-1 Pol fused to an 
amino terminal localized leader sequence (e.g. SEQ ID NO:6), and especially 
preferable, and exemplified by the MRKAdS-Pol construct in e.g.. Example 19, 
biologicaUy inactivated pol ("inact opt Pol"; e.g.. SEQ ID N0:4) which is devoid of 
significant PR, RT, RNase or IN activity associated with wild type Pol. In addition, a 
25 construct related to SEQ ID NO:4 is contemplated which contains a leader peptide at 
the amino terminal region of the lA Pol protein. A specific constinict is Ugated within 
an appropriate DNA plasmid vector containing regulatory regions operatively Unked 
to the respective HIV-1 Pol coding region, with or without a nucleotide sequence 
encoding a functional leader peptide. To this end, various HIV-1 Pol constructs 
30 disclosed herein relate to open reading fiames for cloning to the enhanced first 
generation Ad vectors of tiie present invention (such a series of MRKAdSpol 
adenoviral vaccine vectors), including but not limited to wild type Pol (comprising the 
DNA molecule encoding WT opt Pol, as set forth in SEQ ID N0:2), tPA-opt WTPol, 
(comprising the DNA molecule encoding tPA Pol, as set forth in SEQ ID N0:6). inact 
35 opt Pol (comprising the DNA molecule encoding lA Pol, as set forth in SEQ ID 

N0:4), and tPA-inact opt Pol, (comprising the DNA molecule encoding tPA-inact opt 
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Pol, as set forth in SEQ ID NO: 8). The pol-based versions of enhanced first 
generation adenovirus vaccines elicit CTL and Th cellular immune responses upon 
administration to the host, including prinaates and especially humans. As noted in the 
above, an effect of the cellular immune-directed vaccines of the present invention 
5 should be a lower transmission rate to previously uninfected individuals and/or 

reduction in the levels of the viral loads within an infected individual, so as to prolong 
the asymptomatic phase of HIV-1 infection. 

The present invention further relates to a series of MRKAd5nef-based 
adenoviral vaccines which, similar to HIV gag and pol antigens, generate cellular 

10 immune responses subsequent to adnainistration in mice and non-human primate 
studies. The MRKAdSnef series are exemplified herein by utilizing the improved 
MRK adenoviral backbone in combination with modified versions of HEV nef . These 
exemplified MRKAdSnef vectors are as follows: (1) MRKAdShCMV- 
nef(G2A,LLAA) (E3+), which comprises die improved MRKAdS backbone, a human 

15 CMV promoter an intact Ad5 E3 gene and a modified nef gene: (2) MRKAdSmCMV- 
nef(G2AJLLAA) (E3+), which is the same as (1) above but substituting a murine 
CMV promoter for a human CMV promoter; and (3) MRKAd5mCMV-tpanef (LLAA) 
(E3+), which is the same as (2) except that the nef transgene is tpanef(LLAA). Codon 
optimized versions of HIV-1 Nef and HTV-l Nef modifications are essentially as 

20 described in U.S. AppUcation Serial No. 09/738,782, filed December 15, 2000 and 
PCT Intemational Application PCT/USOO/34162, also filed December 15, 2000, both 
documents which are hereby incorporated by reference. Particular embodiments of 
codon optimized Nef and Nef modifications relate to a DNA molecule encoding HIV- 
1 Nef ftom the HIV-1 jfrl isolate wherein the codons are optimized for expression in a 

25 mammalian system such as a human. The DNA molecule which encodes this protein 
is disclosed herein as SEQ ID N0:9, while the expressed open reading frame is 
disclosed herein as SEQ ID NO: 10. Another embodiment of Nef-based coding 
regions for use in the adenoviral vectors of the present invention comprise a codon 
optimized DNA molecule encoding a protein containing the human plasminogen 

30 activator (tpa) leader peptide fused with the NH2-terminus of the HIV-1 Nef 

polypeptide. The DNA molecule which encodes this protein is disclosed herein as 
SEQ ID NO: 1 1, while the expressed open reading frame is disclosed herein as SEQ 
ID NO: 12. Another modified Nef optimized coding region relates to a DNA molecule 
encoding optinodzed HIV-1 Nef wherein the open reading frame codes for 

35 modifications at the amino terminal myristylation site (Gly-2 to Ala-2) and 

substitution of the Leu-174-Leu-175 dileucine motif to Ala-174-Ala-175, herein 
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described as opt nef (G2A, LLAA). The DNA molecule which encodes this protein is 
disclosed herein as SEQ JD N0:13, while the expressed open reading frame is 
disclosed herein as SEQ ID NO: 14. MRKAdSnef vectors (1) MRKAdShCMV- 
nef (G2A,LLAA) (E3+) and (2) MRKAd5mCMV-nef(G2A JIAA) (E3+) contain this 

5 transgene. An additional embodiment relates to a DNA molecule encoding optimized 
HEV-l Nef wherein the amino terminal myristylation site and dileucine motif have 
been deleted, as well as comprising a tPA leader peptide. This DNA molecule, opt 
tpanef (LLAA), comprises an open reading frame which encodes a Nef protein 
containing a tPA leader sequence fused to amino acid residue 6-216 of HIV-1 Nef 

10 (jfrl), wherein Leu-174 and Leu-175 are substituted with Ala-174 and Ala-175, herein 
referred to as opt tpanef (LLAA) is disclosed herein as SEQ ID NO:15, while the 
expressed open reading frame is disclosed herein as SEQ ID NO: 16. The 
MRKAdSnef vector "MRKAd5mCMV-tpanef(LLAA) (E3+)" contains this transgene. 
Along with the improved MRKAdSgag adenovirus vaccine vector described 

15 herein, generation of a MRKAdSpol and MRKAdSnef adenovirus vector provide for 
enhanced HIV vaccine capabilities. Namely, the generation of this trio of adenoviral 
vaccine vectors? aU shown to generate efifective cellular immune responses subsequent 
to host administration, provide for the ability to administer these vaccine candidates 
not only alone, but preferably as part of a divalent Ci.e., gag and nef, gag and pol, or 

20 pol and nef components) or a trivalent vaccine (i.e., gag, pol and nef components). 
Therefore, a preferred aspect of the present invention are vaccine formulations and 
associated methods of administration and concomitant generation of host cellular 
immune responses associated willi formulating three separate series of MRKAdS- 
based adenoviral vector vaccines. Of course, this MRKAdS vaccine series based on 

2S distinct HIV antigens promotes expanded opportunities for formulation of a divalent 
or trivalent vaccine, or possibly administration of separate formulations of one or 
more monovalent or divalent formulations within a reasonable window of time. It is 
also within the scope of the presMit invention to embadc on combined modality 
regimes which include multiple but distinct components from a specific antigen. An 

30 example, but certainly not a limitation, would be s^iarate MRKAdSpol vectors, with 
one vaccine vector expressing wild type Pol (SEQ ID N0:2) and another 
MRKAdSpol vector expressing inactivated Pol (SEQ ID N0:6). Another example 
might be separate MRKAdSnef vectors, with one vacdne vector expressing the 
tPA/LLAA version of Nef (SEQ ID NO: 16) and another MRKAdSnef vector 

35 expressing the G2AJLLAA modified version of Nef (SEQ ID N0:14). Therefore, the 
MRKAdS adenoviral vectors of the present invention may be used in combination 
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with multiple, distinct HIV antigen classes. Each HIV antigen class is subject to 
sequence manipulation, thus providing for a multitude of potential vaccine 
combinations; and such combinations are within the scope of the present invention. 
The utilization of such combined modalities vaccine foimulation and administration 
5 increase the probability of eliciting an even more potent cellular immune response 
when compared to inoculation with a single modality legimeii. 

The present invention also relates to application of a mono-, dual-, or tri- 
modality administration regime of the MRKAd5gag, pol and nef adenoviral vaccine 
series in a prime/boost vaccination schedule. This prime/boost schedule may include 

10 any reasonable combination of the MRKAdSgag, pol and nef adenoviral vaccine 

series disclosed herein. In addition, a prime/boost regime may also involve other viral 
and/or non-viral DNA vaccine^. A preferable addition to an adenoviral vaccine 
vector regime includes but is not limited to plasmid DNA vaccines, especially DNA 
plasmid vaccines that contain at least one of the codon optimized gag, pol and nef 

1 5 constructions, as disclosed herein. 

Therefore, one aspect of this invention is the administration of the adenoviral 
vector containing the optimized gag gene in a prime/boost regiment in conjunction 
with a plasmid DNA encoding gag. To distinguish this plasmid from the adenoviral- 
containing shuttle plasmids used in the construction of an adenovirus vector, this 

20 plasmid will be referred to as a "vaccine plasmid" or "DNA plasmid vaccine". 
Preferred vaccine plasmids for use in this administration protocol are disclosed in 
pending U.S. patent application 09/017,981, filed February 3, 1998 and WO98/34640, 
published August 13, 1998, both of which are hereby incorporated by reference. 
Briefly, the preferred vaccine plasmid is designated VI Jns-FLgag, which expresses 

25 the same codon-optimdzed gag gene as the adenoviral vectors of this invention (see 
Figure 2 for the nucleotide sequence of the exemplified optimized codon version of 
full length p55 gag). The vaccine plasmid backbone, designated VlJns contains the 
CMV immediate-early (IE) promoter and intron A, a bovine growth homione-derived 
polyadenylation and transcription termination sequence as the gene expression 

30 regulatory elements, and a minimal pUC backbone; see Montgomery et al, 1993, 
DNA Cell Biol. 12:777-783. The pUC sequence permits high levels of plasmid 
production in E, coli and has a neomycin resistance gene in place of an ampiciUin 
resistance gene to provide selected growth in the presence of kanamycin. 
Altematively, a vaccine plasmid which has the CMV promoter deleted of intron A can 

35 be used. Those of skill in the art will recognize that alternative vaccine plasmid 
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vectors may be easily substituted for these specific constructs, and this invention 
specificaUy envisions use of such alternative plasmid DNA vaccine vectors. 

Another aspect of the present invention is a prime/boost regimen which 
includes a vaccine plasmid which encodes an HIV pol antigpn, preferably a codon 
5 optimized form of pol and also preferably a vaccine plasmid which comprises a 
nucleotide sequence which encodes a Pol antigen selected firom flie group of Pol 
antigens as shown in SEQ ID NOs: 2. 4. 6 and 8. The variety of potential DNA 
plasmid vaccines which encode various biologically active forms of KV-l Pol, 
wherein administration, intracellular delivery and expression of the HIV-1 Pol gene of 
10 interest eUcits a host CTL and Th response. The preferred synthetic DNA molecules 
of the present invention encode codon optimized wild type Pol (without Pro activity) 
and various codon optimized inactivated HIV-1 Pol proteins. The HIV-1 pol open 
reading disclosed herein are especially preferred for pharmaceutical uses, especially 
for human administration as deUvered via a recombinant adenoviral vacdne, 
15 espedaUy an enhanced first generation recombmant adenoviral vaccine as described 
herein. Several embodiments of this portion of the invention are providedm detail 
- below, namely DNA molecules which comprise a HIV-1 pol open reading frame, 
whether encoding fiiU length pol or a modification or fusion as described herein, 
wherein the codon usage has been optimized for expression in a mammal, especially a 
20 human. Again, these DNA sequaices are positioned appropriately within a 

recombinant adenoviral vector, such as the exempUfied recombinant adenoviral vector 
described herein, so as to promote expression of the respective HIV-1 Pol gene of 
interest, and subsequent to administration, eUcit a host CTL and Th response. Again, 
• these preferred, but in no way limiting, pol genes are as disclosed herein and 
25 essentially as described in U.S. AppUcation Serial No. 09/745,221. filed December 
21. 2000 and PCI International AppUcation PCT/USOO/34724, also filed December 
21. 2000, both documents which are hereby incorporated by reftasence. 

A third series of vaccine plasmids which are useful in a combined modality 
and/or prime/boost regimen are vaccine plasmids which encode an HIV nef antigen or 
30 biologically and/or immunologically relevant modification thereof. As noted 

elsewhere, preferred vaccine plasmids contain a codon optimized form of nef and also 
preferably comprise a nucleotide sequence which encodes a Nef antigen selected fix>m 
the group of Nef antigens as shown in SEQ ID NOs: 10, 12. 14 and 16. These 
preferred nef coding regions are disclosed herein, as weU as being described in U.S. 
35 Application Serial No. 09/738.782. filedDecember 15, 2000 and PCT International 
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Application PCT/USOO/34162, also filed December 15, 2000. both documents which 
are hereby incorporated by reference. 

Therefore, the adenoviral vaccines and plasmid DNA vaccines of this 
invention may be administered alone, or may be part of a prime and boost 
S administration regimen. A mixed modality priming and booster inoculation scheme 
will result in an enhanced immune response, particularly is pre-existing anti-vector 
immune responses are present. This one aspect of this invention is a method of 
priming a subject with the plasmid vaccine by administering the plasmid vaccine at 
least one time, allowing a predetermined length of time to pass, and then boosting by 

10 administering the adenoviral vaccine. Multiple primings typically, 1-4, are usually 
employed, although more may be used. The length of time between priming and 
boost may typically vary from about four months to a year, but other time frames may 
be used. In experiments with rhesus monkeys, the animals were primed four times 
with plasmid vaccines, then were boosted 4 months later with the adenoviral vaccine. 

IS Their cellular immune response was notably higher than that of animals which had 
only received adenoviral vaccine. The use of a priming regimen may be particularly 
preferred in situations where a person has a pre-existing anti-adenovirus immune 
response. 

Furthemiore and in the altemative, multiple HTV-l viral antigens, such as the 

20 MRKAdS adenoviral vaccines disclosed herein, may be ligated into a proper shuttle 
plasnoid for generation of a pre-adenoviral plasmid comprising multiple open reading 
frames. For example a trivalent vector may comprise a gag-pol-nef ftision, in either a 
E3(-) or E3(+) background, preferably a E3 deleted backbone, or possible a "2+1" 
divalent vaccine, such as a gag-pol fusion (i.e., codon optimized pS5 gag and 

25 inactivated optimized pol; Example 29 and Table 25) within the same MRKAd5 
backbone, with each open reading fr^e being operatively linked to a distinct 
promoter and transcription termination sequence. Alternatively, the two open reading 
frames may be operatively linked to a single promoter, with the open reading frames 
operatively linked by an internal ribosome entry sequence (IRES), as disclosed in 

30 International Publication No. WO 95/24485, which is hereby incorporated by 
reference. Figure 9 shows that the use of multiple promoters and termination 
sequences provide for similar growth properties, while Figure 28 shows that these 
MRKAd5gag-based vectors are also stable at least through passage 21. In the absence 
of the use of IRES-based technology, it is preferred that a distinct promoter be used to 

35 support each respective open reading frame, so as to best preserve vector stability. As 
examples, and certainly not as limitations, potential multiple transgene vaccines may 

31 



wo 02^2080 PCT/USOl/28861 



include a three transgene vector such as hCMV-gagpol-bGHpA + mCMV-nef-SPA in 
an E3 deleted backbone or hCMV-gagpol-bGHpA + mCMV-nef-SPA(E3+). 
Potential "2+1" divalent vaccines of the present invention might be a hCMV-gag- 
bGHpA + mCMV-nef-SPA in an E3+ backbone (vector #1) in combination with 

5 hCMV-pol-bGHpA in an E3+ backbone (vector #2). with all transgenes in the El 
parallel orientation. Fusion constructs other than the gag-pol fusion described above 
are also suitable for use in various divalent vaccine strategies and can be composed of 
any two HTV antigens fused to one another (e.g.„ nef-pol and gag-nef). These 
adenoviral compositions are, as above, preferably deUvered along with an adenoviral 

10 composition comprising an additional HIV antigen in order to diversify the immune 
response generated upon administration. Therefore, a multivalent vaccine delivered 
in a single, or possible second, adenoviral vector is certainly contemplated as part of 
the present invention. Again, this mode of administration is another example of 
whereby an efficaceous adenovirus-based HIV-1 vaccine may be administered via a 

15 combined modality regime. It is important to note, however, that in terms of deciding 
on an insert for the disclosed adenoviral vectors, due consideration must be dedicated 
- to the effective packaging limitations of the adenovirus vehicle. Adenovirus has been 
shovm to exhibit an upper cloning capacity limit of approximately 105% of the 

wildtype Ad5 sequence. 

20 Regardless of the gene chosen for expression, it is preferred that the sequence 

be "optimized" for expression in a human ceUular environment A "triplet" codon of 
four possible nucleotide bases can exist in 64 variant forms. That these forms provide 
the message for only 20 different amino adds (as well as transcription initiation and 
termination) means that some amino acids can be coded for by more than one codon. 

25 Indeed, some amino acids have as many as six "redundant", alternative codons while 
some others have a single, required codon. For reasons not completely understood, 
alternative codons are not at all uniformly present in the endogenous DNA of 
differing types of ceUs and there appears to exist variable natural hierarchy or 
"preference" for certain codons in certain types of ceUs. As one example, the amino 

30 acid leucine is specified by any of six DNA codons including CTA, CTC, CTG, CTT, 
TTA, and TTG (which correspond, respectively, to the mRNA codons, CUA, CUC, 
CUG, CUU, UUA and UUG). Exhaustive analysis of genome codon frequencies for 
microorganisms has revealed endogenous DNA ofE. coli most commonly contains 
the CTG leudne-specifying codon, while the DNA of yeasts and slime molds most 

35 commonly includes a TTA leudne-spedfying codon. In view of this hierardiy, it is 
generally held that the likelihood of obtainmg high levels of expression of a leucine- 
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rich polypeptide by an E. coli host will depend to some extent on the frequency of 
codon use. For example, a gene rich in TTA codons will in all probability be poorly 
expressed in E. colU whereas a CTG rich gene will probably highly express the 
polypeptide. Similarly, when yeast ceDs are the projected transformation host cells 
5 for expression of a leucine-rich polypeptide, a preferred codon for use in an inserted 
DNA would be TTA. 

The implications of codon preference phenomena on recombinant DNA 
techniques are manifest, and the phenomenon may serve to explain many prior 
failures to achieve high expression levels of exogenous genes in successfiilly 

10 transformed host organisms-a less "preferred" codon may be repeatedly present in the 
inserted gene and the host cell machinery for expression may not operate as 
efficiently. This phenomenon suggests Aat synthetic genes which have been designed 
to include a projected host cell's preferred codons provide a preferred form of foreign 
genetic material for practice of recombinant DNA techniques. Thus, one aspect of 

15 this invention is an adenovirus vector or adenovirus vector in some combination with 
a vaccme plasmid where both specifically include a gene which is codon optimized 
for expression in a human cellular environment As noted herein, a preferred gene 
for use in the instant invention is a codon-optimized HIV gene and, particularly, HIV 
gag, pol or nef . 

20 Adenoviral vectors in accordance with the instant invention can be constructed 

using known techniques, such as those reviewed in Hitt et al, 1997 "Human 
Adenovirus Vectors for Gene Transfer into Mammalian Cells" Advances in 
PJiarmacology 40: 137-206, which is hereby incorporated by reference. 

In constructing the adenoviral vectors of this invention, it is often convenient 

25 to insert them into a plasmid or shutde vector. These techniques are known and 
described in Hitt et al., supra. This invention specifically includes both the 
adenovims and the adenovirus when inserted into a shuttle plasnud. 

Preferred shuttle vectors contain an adenoviral portion and a plasmid portion. 
The adenoviral portion is essentially the same as the adenovirus vector discussed 

30 supra, containing adenoviral sequences (with non-functional or deleted El and E3 
regions) and the gene expression cassette, flanked by convenirat restriction sites. The 
plasmid portion of the shutde vector often contains an antibiotic resistance marker 
under transcriptional control of a prokaryotic promoter so that expression of die 
antibiotic does not occur in eukaryotic cells. Ampicillin resistance genes, neomycin 

35 resistance genes and other pharmaceutically acceptable antibiotic resistance markers 
may be used. To aid in the high level production of the polynucleotide by 
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fermentation in prokaryotic OTganisms, it is advantageous for the shuttle vector to 
contain a prokaryotic origin of replication and be of high copy number. A number of 
conunercially available prokaryotic cloning vectors provide these benefits. It is 
desirable to remove non-«ssential DNA sequences. It is also desirable that the vectors 

5 not be able to lepUcate in eukaiyotic cells. This minimizes the risk of integration of 
polynucleotide vaccine sequences into the recipients' genome. Tissue-specific 
promoters or enhances may be used whenevCT it is desirable to limit expression of the 
polynucleotide to a particular tissue type. 

In one embodiment of this invention, the pre-plasmids (e.g., pMRKAdSpol, 

10 pMRKAdSnef and pMRKAdSgag were generated by homologous recombination 
using the MRKHVE3 (and MRKHVO for the E3- version) backbones and the 
appropriate shuttle vector, as shown for pMRKAdSpol in Figure 22 and for 
pMRKAdSnef in Figure 23. The plasmid in linear form is enable of replication after 
entering the PER.C6® cells and virus is produced. The infected cells and media were 

15 harvested after viral replication was complete. 

Viral vectors can be propagated in various El complementing cell lines, 
including the known cell lines 293 and PER.C6®. Both these ceU Unes express the 
adenoviral El gene product. PER.C6® is described in WO 97/00326 (pubUshed 
January 3, 1997) and issued U.S. Patent No. 6,033,908, both of which are hereby 

20 incorporated by reference. It is a primary human retinoblast cell Une transduced with 
an El gene segment that complements the production of replication deficient (FG) 
adenovirus, but is designed to prevent generation of replication competent adaiovirus 
by homologous recombination. Cells of particular interest have been stably 
transformed with a transgene that encodes the AD5E1A andElB gene, lite PER.C6®, 

25 from 459 bp to 3510 bp inclusive. 293 cells are described in Graham et al., 1977 J. 
Gen. Virol 36:59-72, which is hereby incorporated by reference. As stated above, 
consideration must be given to the adenoviral sequences present in the 
complementing ceU line used. It is important that the sequences not overlap with that 
present in the vector if the possibiUty of recombination is to be minimized. 

30 It has been found that vectors generated in accordance with the above 

description are more effective in inducing an inomune response and, thus, constitute 
very promising vaccine candidates. More particularly, it has been found that first 
generation adenoviral vectors in accordance with the above description carrying a 
codon-optimized HIV gag gene, regulated with a strong heterologous promoter can be 

35 used as human anti-HTV vaccines, and arc capable of mducing immune responses. 
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Standard techniques of molecular biology for preparing and purifying DNA 
constructs enable the preparation of the DNA immunogens of this invention. 

A vaccine composition comprising an adenoviral vector in accordance with 
the instant invention may contain physiologically acceptable components, such as 
5 buffer, normal saline or phosphate buffered saline, sucrose, other salts and 

polysorbate. One preferred formulation has: 2.5-10 mM TRIS buffer, preferably 
about 5 mM TRIS buffer; 25-100 mM NaCl, preferably about 75 mM NaQ; 2.5-10% 
sucrose, preferably about 5% sucrose; 0.01 -2 mMMgCh; and 0.001%-0.01% 
polysorbate 80 (plant derived). The pH should range from about 7.0-9.0, preferably 

10 about 8.0. One skUled in the art will appreciate that other conventional vaccine 
excipients may also be used it make the formulation. The preferred formulation 
contains 5mM TRB, 75 mM NaCl, 5% sucrose, ImM MgCh, 0.005% polysorbate 80 
at pH 8.0 This has a pH and divalent cation composition which is near the optimum 
for Ad5 stability and minimizes the potential for adsorption of virus to a glass surface. 

15 It does not cause tissue irritation upon intramuscular injection. It is preferably frozen 
until use. 

The amount of adenoviral particles in the vaccine composition to be 
introduced into a vaccine recipient wiU depend on the strength of the transcriptional 
and translational promoters used and on the immunogenicity of the expressed gene 

20 product. In general, an immunologically or prophylactically effective dose of 1x10 
to 1x10^^ particles and preferably about 1x10^® to 1x10^^ particles is administered 
directly into muscle tissue. Subcutaneous injection, intradermal introduction, 
impression through the skin, and other modes of administration such as 
intraperitoneal, intravenous, or inhalation dehvery are also contemplated. It is also 

25 contemplated that booster vaccinations are to be provided. Follovving vaccination 
with HEV adenoviral vector, boosting with a subsequent HIV adenoviral vector and/or 
plasmid may be desirable. Parenteral administration, such as intravenous, 
intramuscular, subcutaneous or other means of administration of interleukin-12 
protein, concurrendy with or subsequent to parenteral introduction of the vaccine 

30 compositions of this invention is also advantageous. 

The adenoviral vector and/or vaccine plasmids of this invention 
polynucleotide may be unassociated with any proteins, adjuvants or other agents 
which impact on the recipients' iimnune system. In this case, it is desirable for the 
vector to be in a physiologically acceptable solution, such as, but not limited to, sterile 

35 saline or sterile buffered saline. Alternatively, the vector may be associated with an 
adjuvant known in the art to boost immune responses (i.e., a "biologically eflFective" 
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adjuvant), such as a protein or other carrier. Vaccine plasnrids of this invention may, 
for instance, be deUveied in saline (e.g., PBS) with or without an adjuvant Prefened 
adjuvants aie Alum or CRL1005 Block Copolymer. Agents which assist in the 
cellular uptake of DNA, such as, but not limited to, calcium ions, may also be used to 
5 advantage. Hiese agents are genesraUy referred to herein as transfection facilitating 
reagents and pharmaceutically acceptable carriers. Techniques for coating 
microprojectiles coated with polynucleotide are known in the art and are also useful in 

connection with this invention. 

This invention also includes a prime and boost regimen v^erein a first 

10 adenoviral vector is administered, then a booster dose is given. The booster dose may 
be repeated at selected time intervals. Altematively, a preferred inoculation scheme 
comprises priming with a first adenovirus serotype and then boosting with a second 
adraiovirus serotype. More preferably, the inoculation scheme comprises priming 
with a first adenovirus serotype and then boosting wifli a second adenovirus serotype, 

1 5 wherein the first and second adraiovirus serotypes are classified within separate 

subgroups of adenoviruses. The above prime/boost schemes are particularly preferred 
in those situations where a preexisting immunity-is identified to the adenoviral vector 
of choice. In this type of scheme, the individual or population of individuals is 
primed with an adenovirus of a serotype other than that to which the preexisting 

20 immunity is idaitified. This enables the first adenovirus to ^ectuate sufficient 

expression of the transgene while evading existing immunity to the second adenovims 
(the boosting adenovirus) and. fiirther, allows for the subsequent delivery of the 
transgene via the boosting adenovirus to be more effective. Adenovirus serotype 5 is 
one example of a virus to which such a scheme might be desuable. In accordance 

25 with this invention, fliercfore, one might decide to prime with a non-group C 

adenovirus (e.g., Adl2, a group A adenovirus, Ad24, a group D adenovirus, or Ad35, 
a group B adenovirus) to evade anti-Ad5 unmunity and then boost with Ad5, a group 
C adenovirus. Another preferred embodiment involves administration of a different 
adenovirus (including non-human adenovirus) vaccine foUowed by adnjinistratipn of 

30 the adenoviral vaccines disclosed. In the alternative, a viral antigen of interest can be 
first delivered via a viral vaccine other than an adenovirus-based vaccine, and then 
foUowed with the adenoviral vaccine disclosed. Alternative viral vaccines include but 
are not limited to pox virus and Venezuelan equine encqjhilitis virus. 

A large body of human and animal data supports the importance of cellular 

35 immune responses, especially CTL in controlling (or eliminating) HIV infection. In 
humans, very high levels of CTL develop foUowing primary infection and correlate 
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with the control of viiemia. Several small groups of individuals have been described 
who are repeatedly exposed to HIV by remain uninfected; CTL has been noted in 
several of these cohorts. In the SIV model of HIV infection, CTL similarly develops 
following primary infection, and it has been demonstrated that addition of anti-CD8 
5 monoclonal antibody abrogated this control of infection and leads to disease 

progression. This invention uses adenoviral vaccines alone or in conabination with 
plasmid vaccines to induce CTL. 

The following non-limiting Examples are presented to better illustrate the 

invention. 

10 

EXAMPLE 1 

Removal of the Intron A Portion of the hCMV Promoter 
GMP grade pVUnsHIVgag was used as the starting material to amplify the 
hCMV promoter. PVUnsHIVgag is a plasmid comprising the CMV immediate-early 

15 (IE) promoter and intron A, a full-length codon-optimized HTV gag gene, a bovine 
growth hormone-derived polyadenylation and transcriptional termination sequence, 
and a minimal pUC backbone; see Montgomery et a/., supra for a description of the 
plasmid backbone. The amplification was performed with primers suitably positioned 
to flank the hCMV promoter. A 5' primer was placed upstream of the Mscl site of 

20 the hCMV promoter and a 3* primer (designed to contain the Bgfll recognition 

sequence) was placed 3' of the hCMV promoter. The resulting PGR product (using 
high fidelity Taq polymerase) which encompassed the entire hCMV promoter (minus 
intron A) was cloned into TOPO PGR blunt vector and then removed by double 
digestion with Mscl and Bg/II. This fragment was then cloned back into the origmal 

25 GMP grade pV IJnsHIVgag plasmid from which the original promoter, intron A, and 
die gag gene were removed following Mscl and BglH digestion. This ligation 
reaction resulted in the construction of a hCMV promoter (minus intron A) + bGHpA 
expression cassette within the original pVUnsHIVgag vector backbone. This vector 
is designated pVIJnsCMV(no intron). 

30 The FLgag gene was excised from pVUnsHIVgag using BgZH digestion and 

the 1,526 bp gene was gel purified and cloned into pVUnsCMV(no intron) at the 
< BgHL site. Colonies were screened using Siml restriction enzymes to identify clones 
that carried the Hgag gene in the correct orientation. This plasmid, designated 
pVlJnsCMV(no intron)-FLgag-bGHpA, was fully sequenced to confirm sequence 

35 integrity. 
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Two additional transgenes were also constructed. The plasmid, 
pVl JnsCMV(no mtron)-ELgag-SPA, is identical to pVlJnsCMV(no intron)-FLgag- 
bGK^A except that the bovine growth hormone polyadenylation signal has been 
replaced with a short synttietic polyA signal (SPA) of 50 nucleotides in length. The 
5 sequence of the SPA is as shown, with the essential components (poly(A) site, (GT)n, 
and (T)n; respectively) underlined: 

A ATA AAA GAIUnTATrrreATTAGATCTrrTGTG TTGGTmTTGTGTG 

(SEQ ID NO:18). 

The plasmid, pVlJns-mCMV-FLgag-bGHpA, is identical to the 
10 pVl JnsCMV(no intron)-FLgag-bGHpA except that the hCMV promoter has been 
removed and replaced with the murine CMV (mCMV) promota:. 

Figure 3 diagrammatically shows the new transgene constructs in comparison 
with the original transg^e. 



15 EXAMPLE 2 

Gag Expression Assay for Modified Gag Transgenes 
_ Gag Elisa was paformed on culture supematants obtained from transiait - 

tissue culture transfection experiments in which the two new hCMV-containing 
plasmid constructs, pVlJnsCMV(no intron)-FLgag-bGHpA and pVlJnsCMV(no 

20 intron)-FLgag-SPA, both devoid of intron A, were compared to pVl JnsHIVgag 

which, as noted above possesses the intron A as part of the hCMV promoter. Table 2 
below shows the in vitro gag expression data of the new gag plasmids compared with 
the GMP grade original plasmid. The results displayed in Table 2 show that both of 
the new hCMV gag plasmid constructs have expression capacities comparable to the 

25 original plasmid construct which contains the intron A portion of the hCMV 
promoter. 
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Table 2: In vitro DNA transfection of original and new plasmid HTV-l ^ag constructs. 



Plasmid 


Ug gag/10e6 COS cells/5ug DNA/48 hr 


HIVEL-gagPR9901' 


10.8 


PVnns-hCMV-FLgag-bGHpA" 


16.6 


pVlJns-hCMV-FLgag-SPA'^ 


12.0 



^ GMP grade pVl Jns-hCMVintionA-ELgag-bGHpA. 
5 ^ New plasmid constructions that have the intron A portion removed from the hCMV 
promoter. 

^ In this construct the bGH terminator has been replaced with the short synthetic 
polyadenylation signal (SPA) 



10 EXAMPLE 3 

Rodent (Balb/c) Study for Modified gag Transgenes 
A rodent study was performed on the two new plasmid constructs 
described above - pVlJnsCMV(no intron)-FLgag-bGHpA and pVlJnsCMV(no 
intron)-FLgag-SPA - in order to compare them with the construct described above 
15 possessing the intron A portion of the CMV promoter, pVlJnsHIVgag. Gag antibody 
and Elispot responses (described in PCT International Application No. 
PCT/USOO/18332 (WO 01/02607) filed July 3, 2000, claiming priority to U.S. 
Provisional Application Serial No. 60/142,63 1 , filed July 6, 1999 and U.S. 
Application Serial No. 60/148,981, filed August 13, 1999, all three applications which 
20 are hereby incorporated by reference) were measured. The results displayed in Table 
3 below, show that the new plasmid constructs behaved equivalently to the original 
. construct in Balb/c mice with respect to their antibody and T-cell responses at both 
dosages of plasmid DNA tested, 20 \ig and 200 \ig. 
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EXAMPLE 4 



Table 3: HIV1 91 : Immunogenicity of VlJns-gag under different promoter and termination 
elements. 



DNA" 


Dose, 
ug'* 


Anti-p24 Titers 
fSWkPDI)'' 


SFC/10''6 Cells 
(4 Wk PD1)** 


Promoter/terminator 


GMT 1 


+SE 




Media 


gag197-205 


p24 


HIVFL-gagPR9901 
(GMP grade) 


200 
20 


12800 
5572 


4652 
1574 


3412 
1227 


2(2) 
0 


129(19) 
56(9) 


30(11) 
25(6) 


pVUns-hCMV- 
FL-gag-bGHpA 


200 
20 


11143 
7352 


2831 
2808 


2257 
2032 


0 
0 


98(5) 
73(9) 


12(6) 
11(6) 


pVUns-hCMV- 
FL-gag-SPA 


200 
20 


16890 
5971 


5815 
5361 


4326 
2825 


1(1) 
0 


94(4) 
85(17) 


26(7) 
38(10) 


Narve 


0 


123 


50 


36 


0 


0 


0 



'in PBS 

\m. Injections into both quads, 50 jiL per quad 
'n=10;GMT, geometiic mean liter; SE, standard, enwr 

"n.S pooled spleens; mean of tripBcate wells and standard, devlalion. m parentheses; 



Construction oftoe MoMed Shutde Vector t^^mKpde^^ 
The modifications to the original Ad5 shuttle vector (pdelElspl A; a vector 
comprising Ad5 sequences ftom basepairs 1-341 and 3524-5798. with a multiple 

5 . cloning region between nucleotides 341 and 3524 of Ad5, included the following 
three manipulations carried out in sequential cloning steps as follows: 
(1) Hie left rm region was extended to include the Pacl site at the junction between 
the vector backbone and the adenovirus left ITR sequences. This aUow for easier 
manipulations using the bacterial homologous recombination system. 

10 (2) The packaging region was extended to include sequences of the wild-type (WT) 
adenovirus from 342 bp to 450 bp inclusive. 

(3) The area downstream of pIX was extended 13 nucleotides (i.e., nucleotides 3511- 
3523 inclusive). 

These modifications (Figure 4) effectively reduced the size of the El deletion witiiout 
15 overlapping with any part of the EIA/EIB gene present in tiie transformed PER.C6 
cell Une. All manipulations were performed by modifying the Ad shutde vector 
pdeffilsplA. 

Once the modifications were made to ttie shuttle vector, the changes were 
incorporated into tiie original Ad5 adenovector backbones (pAdHVO andpAdHVE3) 
20 by bacterial homologous recombination using E. coli BJ5183 chemically competent 
cells. 
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EXAMPLES 

Construction of Modified Adcnovector Backbones fB3+ and E3-) 
The original adeno vectors pAdHVO (comprising all Ad5 sequences except 
those nucleotides encompassing the El and £3 regions ) and pADHVES (comprising 
5 all Ad5 sequences except those nucleotides encompassing the El region), were each 
reconstmcted so that they contained the modifications to the El region. This was 
accomplished by digesting the newly modified shutde vector (MRKpdelEl shuttle) 
with Pacl and BstZl 101 and isolating the 2,734 bp fragment which corresponds to the 
adenovirus sequence. This fragment was co-transformed with DNA from either CZal 

10 linearized pAdHVO (E3- adenovector) or Clal linearized pAdHVE3 

(E3+adenovector) into E. coli BJ5183 competent cells. At least two colonies from 
each transformation were selected and grown in Terrific™ broth for 6-8 hours until 
turbidity was reached. DNA was extracted from each cell pellet and then transformed 
into E. coli XLl competent cells. One colony from each transformation was selected 

15 and grown for plasmid DNA purification. The plasmid was analyzed by restriction 
digestions to identify correct clones. The modified adenovectors were designated 
MRKpAdHVO (E3- plasmid) and MRKpAdHVE3 (E3+ plasmid). Virus from these 
new adenovectors (MRKHVO and MRKHVE3, respectively) as well as the old 
version of the adenovectors were generated in the PER.C6® cell lines to accommodate 

20 the following series of viral competition experiments. In addition, the multiple 
cloning site of the original shuttle vector contained Clal , BamHI, Xho I, EcoRV, 
Hindin, Sal I, and Bgl U sites. This MCS was replaced with a new MCS containing 
Not I, Cla I, EcoRV and Asc I sites. This new MCS has been transferred to the 
MRKpAdHVO and MRKpAdHVE3 pre-plasmids along with the modification made 

25 to the packaging region and pIX gene. 

EXAMPLE 6 

Analvsis of the Effect of the Packapuig Sipial Extension 
To study the effects of the modifications made to the El deletion region, the 
30 viruses obtained from the original backbone (pAdHVE3) and the new backbone 
(MRKpAdHVE3) were mixed together in equal MOI ratios (1:1 and 5:5) and 
passaged through several rounds; see Figure 5, Expt.#l. Both of the viruses in the 
experiment contained the E3 gene intact and did not contain a transgene. The only 
difference between the two viruses was within the region of the El deletion. 
35 Following the coinfection of the viruses at PI (passage 1), the mixtures were 

propagated through an additional 4 passages at which time the cells were harvested 
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and the virus extracted and purified by CsCl banding. The viral DNA was extracted 
and digested with ffimffll and the digestion products were then radioactively labeled. 
For the controls, the respective pre-plasmids (pAdHVES ("OLD E3+"); 
MRKpAdHVES ("NEW E3+")) were also digested with HindSa (and Pacl to remove 

5 the vector backbone) and subsequenfly labeled with [^¥]dATP. The radioactively 
labeled digestion products were subjected to gel electirophoresis and the gel was dried 
down onto Whatman paper before being exposed to autoradiographic film. Hgure 6 
clearly shows that the new adenovirus which has die addition made to the packa^g 
signal region has a growth advantage compared with the original adenovirus. In fee 

10 experiments performed (at either ratio tested), only the digestion bands pertaining to 
the newly modified virus were present The diagnostic band of size 3,206 (firom the 
new virus) was clearly present. However, fliere was no evidence of the diagnostic 
band of size 2,737 bp expected from the original virus. 

15 EXAMPLE 7 

Anal ysis nf the Effect o f the E3 Gene 
The second set of the virus competition stody involved mixing equal MOI 
ratio (1 : 1) of the newly modified viruses, that obtained from MilKpAdHV O and 
MRKpAdHVES (Rguie 5, Expt. #2). In this set, both viruses had die new 

20 modifications made to the El deletion. The first virus (that fiom MRKpAdHVO) 
does not contain an E3 gene. The second virus (that from MRKpAdHVE3) does 
contain the E3 gene. Neither of the viruses contain a toansgene. Following co- 
infection of the viruses, the mixtures were propagated through an additional 4 
passages at which time the cells were harvested and the total virus extracted and 

25 purified by CsCl banding. The viral DNA was extracted and digested with Hindm. 
and die digestion products were then radioactively labeled. For die contix>ls, the 
respective pre-plasmids MRKpAdHVO ("NEW E3-"); MRKpAdHVE3 ("NEW 
E3+") were also digested with Hindm (and Foci to remove the vector backbone) and 
then labeled widi [33p]dATP. The radioactively labeled digestion products were 

30 subjected to gel electrophoresis and the gel was dried down onto Whatman paper 

before being exposed to autoradiographic fihn. Figure 6 shows the results of tii& viral 
DNA analysis of die E3+ virus and E3- virus mixing experiment. The diagnostic 
band corresponding to the E3+ virus (5,665 bp) was present in greater amount 
compared with die diagnostic band of 3,010 bp corresponding to die E3- virus. This 

35 indicates tiiat die virus diat contains the E3 gene is able to amplify more rapidly 
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compared with the virus that does not contain an E3 gene. This increased 
amplification capacity has been confirmed by growth studies; see Table 4 below. 

EXAMPLES 

Construction of the new shutde vector containing modifieri g a g transgene - 

'*MRKpdelEl-CMVfno intronVFLgag-bGHpA" 
The modified plasmid pVl JnsCMV(no intron)-ELgag"bGHpA was digested 
with Mscl overnight and then digested with ,^1 for 2 hours at 50°C. The DNA was 
then treated with Mungbean nuclease for 30 mins at 30**C. The DNA mixture was 
desalted using the Qiaex II kit and then Klenow treated for 30 mins at ST'C to fully 
blunt the ends of the transgene firagment. The 2,559 bp transgene fragment was then 
gel purified. The modified shutfle vector (MRKpdelEl shuttle) was linearized by 
digestion with EcoRV, treated with calf intestinal phosphatase and the resulting 6,479 
bp fragment was then gel purified. The two purified fragments were then ligated 
together and several dozen clones were screened to check for insertion of the 
transgene within the shutde vector. Diagnostic restriction digestion was performed to 
identify those clones carrying the transgene in the El parallel and El anti-parallel 
orientation. This strategy was followed to clone in the other gag transgenes in the 
MRKpdelEl shuttle vector. 

EXAMPLB9 
Construction of the MRK FG Adenovectors 
The shuttle vector containing the HTV-l gag transgene in the El parallel 
orientation, MRKpdelEl -CMV(no intron)-FLgag-bGHpA, was digested with PacL 
The reaction mixture was digested with BsfZlll. The 5,291 bp fragment was purified 
by gel extraction. 'The MRKpAdHVE3 plasmid was digested with Clal overnight at 
3TC and gel purified. About 100 ng of the 5,290 bp shuttle +transgene fragment and 
-100 ng of linearized MRKpAdHVE3 DNA were co-transformed into K coli BJ5183 
chemically competent cells. Several clones were selected and grovm in 2 ml 
Terrific™ broth for 6-8 hours, until turbidity was reached. The total DNA from the 
cell pellet was purified using Qiagen alkaline lysis and phenol chloroform method. 
The DNA was precipitated with isopropanol and resuspended in 20 (jQ dH20. A 2 jil 
aliquot of this DNA was transformed into E. coli XL-1 competent cells. A single 
colony from each separate transformation was selected and grown overnight in 3 ml 
LB +100 ^g/ml ampicillin. The DNA was isolated using Qiagen columns. A positive 
clone was identified by digestion with the restriction enzyme BstEH which cleaves 
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within the gag gene as well as the plasmid backbone. The pre-plasmid clone is 
designated MBlKpAdHVE3+CMV(no intron)-ELgag-bGHpA and is 37,498 bp in size. 
This strategy was foDowed to generate E3- and E3+ versions of each of the otfier gag 
transgene constructions in both El parallel and El anti-parallel versions. Figures 7A, 
5 7B and 7C show the various combinations of adenovectors constructed. 

EXAMPLE 10 
Plasmid Competition Studies 
A series of plasmid competition studies was carried out Briefly, the screening 

10 of the various combinations of new constructs was performed by mixing equal 

amounts of each of two competing plasmids. In the experiment shown in Kgure 8A, 
plasmids containing the same transgene but in different orientations were mixed 
together to create a "competition" between the two plasmids. The aim was to look at 
the effects of transgene orientation. In the experiment shown in Kgure 8B, plasmids 

15 containing different polyadenylation signals G)ut in the same orientation) were mixed 
together in equal amounts. The aim was to assess effects of polyA signals. Following 
the initial transfection, the-vims was passaged throu^ ten rounds and the viral DNA 
analyzed by radioactive restriction analysis. 

Analysis of die viral species from the plasmid mixing experiment (Figure 8A) 

20 showed that adenovectors which had the transgene inserted in the El parallel 

orientation amplified better and were able to out-compete the adenovirus which had 
the transgene inserted in the El anti-parallel orientation. Viral DNA analysis of the 
mixtures at passage 3 and certainly at passage 6, showed a greater ratio of the virus 
carrying the transgene in the El parallel orientation compared witii the El antiparallel 

25 version. By passage 10, the only viral species observed was the adenovector with the 
transgene in the El parallel orientation for both transgenes tested (hCMV(no intron)- 
FLgag-bGHpA and hCMV(no intron>-FLgag-SPA). 

Analysis of the viral species from the plasmid mixing experiment #2 (Figure 
8B) at passages 3 and 6 showed that the polyadenylation signals tested (bGHpA and 

30 SPA) did not have an effect on the growth of die virus. Even at passage 10 the two 
viral species in the mixture were still present in equal amounts. 
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EXAMPLE 11 

Vims generation of an enhanced adenoviral construct - "MRK Ad5 HIV-lgag" 
The results obtained from the competition study allowed us to make the 
following conclusions: (1) The packaging signal extension is beneficial; (2) Presence 
5 of E3 does enhance viral growth; (3) El parallel orientation is recommended; and (4) 
PolyA signals have no effect on the growth of the adenovirus. 

MRK Ad5 HIV-1 gag exhibited the most desirable results. This construct 
contains the hCMV(no intron)-FLgag-bGHpA transgene inserted into the new E3+ 
adenovector backbone, MRKpAdHVE3, in the El parallel orientation. We have 
10 designated this adenovector MRK Ad5 HIV-1 gag. This construct was prepared as 
outlined below: 

The pre-plasmid MllKpAdHVE3+CMV(no intron)-FLgag-bGHpA was 
digested was Pacl to release the vector backbone and 3.3 [ig was transfected by 
calcium phosphate method (Amersham Pharmacia Biotech.) in a 6 cm dish containing 

15 PER.C6® cells at -60% confluence. Once CPE was reached (7-10 days), the culture 
was freeze/thawed three times and the cell debris pelleted. 1 ml of this cell lysate was 
used to infect into a 6 cm dish containing PER.C6® cells at 80-90% confluence. Once 
CPE was reached, the culture was freeze/thawed three times and the cell debris 
pelleted. The cell lysate was then used to infect a 15 cm dish containing PER.C6® 

20 cells at 80-90% confluence. This infection procedure was continued and expanded at 
passage 6. The virus was then extracted from the cell pellet by CsCl method. Two 
bandings were performed (3-gradient CsCl followed by a continuous CsCl gradient). 
Following the second banding, the virus was dialyzed in A105 buffer. Viral DNA 
was extracted using pronase treatment followed by phenol chlorofonn. The viral 

25 DNA was then digested with HindHI and radioactively labeled with [33p]dATP. 
Following gel electrophoresis to separate the digestion {HX>ducts the gel was dried 
down on Whatman paper and then subjected to autoradiography. The digestion 
products were compared with the digestion products from the pre-plasmid (that had 
been digested with Pacl/HindUl prior to labeling). The expected sizes were 

30 observed, indicating that the virus had been successfully rescued. This strategy was 
used to rescue virus from each of tiie various adenovector plasmid constructs 
prepared. 
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EXAMPLE 12 
Stability Analyses 

To determine whether the various adenovector constructs (e.g., MRK Ad5 
mV-l gag) show genetic stability, the viruses were each passaged continually. The 
5 viral DNA was analyzed at passages 3, 6 and 10. Each virus maintained its correct 
genetic structure. In addition, the stability of the MRK Ad5 HTV-l gag was analyzed 
under propagation conditions similar to that performed in large scale production. For 
this analysis, the transfections of MRK Ad5 HTV-l gag as well as three other 
adenoviral vectors were repeated and the virus was purified at P3. The three other 

10 adenovectors were as follows: (1) that comprising hCMV(no intron)-Rgag with a 
bGHpA terminator in an E3- adenovector backbone; (2) that comprising hCMV(no 
intron)-Hgag with a SPA termination signal in an E3+ adenovector backbone, and 
that comprising a mCMV-Hgag with a bGHpA terminator in an E3+ adenovector 
backbone. All of the vectors have the transgene inserted in the El parallel orientation. 

15 Viral DNA was analyzed by radioactive restriction analysis to confirm that it was 
correct before being delivered to fermentation cell culture for continued passaging in 
seram-fiee media. At P5 each of the four vuiises were purified and the viral DNA 
extracted for analysis by the restriction digestion and radiolabeling procedure. This 
virus has subsequendy been used in a series of studies {in vitro gag expression in COS 

20 cells, rodent study and rhesus monkey study) as will be described below. The viruses 
jfrom P5 are shown in Figure 9. 

The passaging under serum-free conditions was continued for the 
MRKHVE3 (transgene-less, obtamed from MRKpAdHVE3 pre-plasmid) and the 
MRKAdSHIV-lgag (obtained from MRKpAdHVE3+CMV(no intron)-ELgag- 

25 bGHpA pre-plasmid) viruses. Figure 10 shows viral DNA analysis by radioactive 
restriction digestion at passage 11 for MRKHVE3, MRKAd5HIV-lgagE3-, and 
passage 11 and 12 for MRKAd5HIV-lgag. Aside fi-om the first lane which is the 
DNA marker lane, the next three lanes are virus from the pre-plasmid controls 
(controls based on the original virus) - MRKpAdHVE3 (also referred to as 

30 "pMRKHVES"), MRKpAdHVE3+CMV(no intron)-FLgag-bGHpA, and 

pMRKAd5gag(E3-). respectively. As seen in Figure 10, each of the viral DNA 
samples show the expected bands with no extraneous bands showing. This signifies 
that there are no major variant adenovirus species present that can be detected by 
autoradiography. 

35 Figure 1 1 shows the results of viral competition study between MRKHVE3 

and MRKAd5HIV-l gag. These viruses were mixed together at equal MOI (140 viral 
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particles each; 280 vp total) at passage 6 and continued to be passaged until PH. 
Aside from the first lane which is the DNA marker lane, the next two lanes are the 
pie-plasmid controls obtained from MRKpAdHVES and MRKpAdHVE3+CMV(no 
intron)-ELgag-bGHpA. The next two lanes are the viral DNA from the starting viral 
5 material at passage six. The last two lanes are the competition studies performed in 
duplicate. The data in Figure 11 shows the effect the gag transgene in culture. 
Growth of a MRKAdSgag virus was compared with growth of a "transgene-less" 
MRKHVE3. These two vmxses were infected at the same MOI (i.e. 140 vp each) at 
passage 6 and then passaged through to passage 11 and the viral pool was analyzed by 
10 radioactive restriction analysis. The data shows that one virus did not out compete the 
other. Therefore, the gag transgene did not show obvious signs of toxicity to the 
adenovirus. 

Analysis by HindSn digestion shows that each virus specie is present in . 
approximately equal amounts. As above, there does not appear to be signs of any 
15 extraneous bands. Figure 12 shows higher passage numbers for MRKAdSHTV-lgag 
grown under serum-containing conditions. Hie genome integrity again has been 
maintained and there is no evidence of rearrangements, even at the highest passage 
level (P21). 

Each of the four vectors shown in Figure 9 were analyzed for amplification 
20 capacity. Table 4 below shows the QPA analysis used in the estimation of viral 

amplification ratios at F4. The determination of the amplification ratio for the 

original HIV-1 gag construct is based on the clinical lot at P12. It has been shown 

that amplification rates increases with higher passage number for the original virus. 

The reason for this observation is due to the emergence of variants which exhibit 
25 increased growth rates compared to the intact adenovecton With continued passaging 

of the original Ad gag vector, the level of variants increases and hence amplification 

rates increase also. 

The MRK Ad5 HIV- 1 gag virus has also been continually passaged xmder 

process conditions (i.e., serum-free media). Viral DNA extracted from passages 11 
30 and 12 show no evidence of rearrangement 
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Table 4: 

Amplification Ratios Based on AEX and QPA Analysis of 
Virus Amplification from Passage 3 to Passage 4. 



Ad gag construct 


Amplification Ratio 


MRKAdSgag 


470 


HCMV-Hp[ap-bGHDA fES-l 


115 


HCMV-Fl^ap-SPA (E3+1 


320 


mCMV-ELpap-bGHpA fE3+] 


420 


Original construct * 


40-50 



5 

* This estimation is based on tiie clinical lot growtii characteristics at Passage 12. 



EXAMPLE 13 

10 Analytical Evaluation of the enhanced Ad5 Constructs 

To study the eflEects of the transgene and the E3 gene on virus anq>lification, 
the enhanced adenoviral vector, MRK Ad5 HIV-1 gag, along with its transgene-less 
version (MKJKpAdHVE3) and its E3- version (MR&AdS HIV-1 gag E3-), was 

studied for several passages under serum-free conditions. Table 5A shows the 
15 amplification ratios determined for passages P3 to P8 for MRK Ad5 fflV-1 gag. 
Within a certain MOI range, it has been determined that the virus output is directly 
proportional to the virus input Therefore, the greater the number of virus particles 
per ceU at infection, the greater the virus amount produced. Viral ampUfication ratios, 
on the other hand, are invrasely proportional to the virus input. The lower tiie vims 
20 input, the greater the amplification ratio. 

Table 5B shows the amplification rates of the new E3+ vector backbone 
MRKpAdHVE3. It has a significanfly lower rate of amplification compared with tiie 
gag ttansgene containing version. IMs may be contributed to the larger size MRK 
Ad5 HIV-1 gag since it contains tiie transgene. This inclusion of the transgene brings 
25 tiie size of tiie adenovirus closer to tiie size of a wild type Ad5 virus. It is weU known 
fliat adenoviruses amplify best when tiiey are at close to their wild type genomic size. 
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Wild type Ad5 is 35,935 bp. The MRKpAdHVE3 is 32, 905 bp in length. The 
enhanced adenovector MRK Ad5 HTV-l gag is 35,453bp (See Figure 14 for vector 
map; see also Figure 15A-X show the complete pre-adenoviral vector sequence, 
which includes an additional 2,021 bp of the vector backbone). 
5 Table 5C shows the amplification rates of the new E3- gag containing virus 

MRK Ad5 EIV-1 gag E3-. Once again, this virus shows lower growth rate than the 
enhanced adenoviral vector. This roay be attributed to the decreased sized of this 
virus (due to the E3 gene deletion) compared with wild type Ad5. The MRK Ad5 
HIV-1 gag E3- virus is 32,810 bp in length. This can be compared with the wild type 
10 Ad5 which is 35,935 bp and MRK Ad5 HIV-1 gag which is 35,453 bp in length. 



49 



wo 02/22080 



PCT/USOl/28861 



Table 5A: Amplification ratios determined by AEX and QPA for WIRKAdSgag over several 
continuous passaging In serum free media. Following P6, two replicate samples were taken (rep- 
1 and rep-2) and analyzed. 



MRKAdSgag repl 





Xv{1lf eeOs/ir 
bitasUon 


iO,VlabBity{%) 
Harvesl 


HarvaslTlnia 


CeflPassags 
Nmnbsr 


Utar 
10*'vp/miculliire 


Tlar 


QPA 

1(fTaD«AnI 


Rslio 

A£X:QPA 


AmpUicaOon 
RaSo 


AEX 
Inttmst Control 


P4 


1^,B1% 


058^50% 


44 


46 


a.7 


5.9 


1.72 


50 


470 
fMOI = 125) 




P5 


1^93% 


0.68.47% 


48 


49 


6.7 


4.9 


1.38 


49 


170 


pe 


1.04, 94X 


0.68. 773^ 


47 


48 


&8 


S.6 


Tm. 


41 




P7 


1^,84% 


a86,61% 


49.S 


SO 


3.9 


14 


097 


40 


SO 


P7 


t.09. 97% 


a76,59% 


SO 


52 


52 


4.7 


1.70 


31 


170 


P8 


1.03.94% 


085.64% 


47.5 


54 


9.0 


a? 


1.10 


82 


310 


P9 


069.85% 


ass, 73% 


47.5 


56 


AA 


4J 


1X>3 


43 




ai2 

2.84 


P10 


1.09,91% 


1j06b6e% 


47.5 


58 


3.0 


2.8 


1.16 


28 


100 


2.70 
2.60 


Pit 


1.19. 88% 


098.65% 


47 


60 


3.6 


ao 


1.15 


31 


110 


Z70 
2.70 


P12 


aga, 9i% 


0.85,63% 


473 


47 


5w4 


5J 


1.20 


45 


200 


2.86 
2.60 


P13 


1.00.86% 


0.70,67% 


49 


49 


&8 


5.8 


1.11 


62 


210" 


3.1S 
3.18 


P14 


-1.94,92% 


a8a.€7% 


46 


63 


6£ 


4.4 






160 


3.28 
327 


PI 5 


0.97,96% 


064.66% 


47 


47 


6.9 


7.1 






250 


3.12 
2.91 



Table 5B: Amplification ratios detemnined by AEX and QPA 
continuous passaging in serum free media, MRKHVE3 is the 
NOT cany a transgene. 

MRKHVE3 



for MRKHVE3 over several 
new vector backbone which does 
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Table SC. Amplification ratios determined by AEX and QPA for MRKAd5gag(E3-) over several 
continuous passaging in serum free media. This construct Is identical to the MRKAdSgag construct 
except that this version is DELETED of the E3 gene. 

5 

MRKAd5gag(E3') 





Xv(i0^oeQsA; 
InteGllan 


iQ. Viability (%) 
Haivest 


HwBStTbno 

h.p.r. 


Cefl Passage 
Numbar 


Tllar 
10"vp/adcultura 


THar 
10*ypfeeD 


QPA 


Ralto 
AEX.-QPA 


AmpiScaiton 

Ratio 


AEX 
tntftmal Control 


FM 


1.62.77% 


1.12,62% 


47.S 


46 


2.0 


1.2 


092 


20 


100 
IMOWISS) 




PS 


1.16. K% 


062,43% 


49 


49 


3.3 


2.9 


099 


34 


100 


P6 


1.71,86% 


02010% 


49 


90 


4.7 


2,7 


1.70 


28 


100 


P6 


1.09. 97% 


063, 54% 


49S 


52 


5.4 


5.0 


1.76 


31 


180 


P7 


1.17.91% 


0^8, 72% 


47.50 


54 


7,1 


6.1 


0,67 


108 


220 


pa 


0l9B, BB% 


077.46% 


48 


66 


3.1 


3.2 


0.66 


47 


116 


3.12 
2J4 


P9 


1J20,B9% 


1.03.72% 


46 


56 


1.6 


\S 


057 


92 


S5 


2.70 
2.60 




099.62% 


08062% 


405 


60 


ZZ 


Z2 


0.68 


47 


116 


2.70 
2.70 


P11 


1.07. 96% 


098,70% 


48^ 


47 


53 


5.5 


0.68 


87 


200 


2^ 
2.60 


P12 


0.80. 91% 


0£7,59% 


SO 


49 


5.1 


&4 


072 


71 


230 


3.18 
3.16 


P13 


1.96.95% 


091.69% 


4&5 


53 


7.4 


3^ 






135 


3.28 
3.27 


P14 


097.96% 


061,74% 


48 


47 


6.6 








250 


3.12 
£91 


P15 


067,69% 


064,66% 


49 


49 


4.8 


5£ 






198 


276 
232 



EXAMPLE 14 
Gag Expression Analysis of the Novel Constructs 
In vitro gag analysis of the MRK Ad5 HTV-l gag and the original HIV-gag 
10 vectors (research and clinical lot) show comparable gag expression. The clinical lot 
shows only a slightly reduced gag expression level. The most noticeable difference is 
with the mCMV vector. This vector shows roughly 3 fold lower expression levels 
compared with the other vectors tested (which all contain hCMV promoters). The 
mCMV-FLgag with bGHpA assay was peifonned three times using different 
IS propagation and purification lots and it consistently exhibited weaker gag expression. 

EXAMPLE 15 

Evaluation of MRK Ad5 HTV-l gag and Other gag-Containing 
. . Adenovectors in Balb/c Mice 
20 Cohorts of 10 balb/c mice were vaccinated intramuscularly with escalating 

doses of MRK Ad5 HTV-l gag, and the research and clinical lots of original 
AdSHTV-lgag. Serum samples were collected 3 weeks post dose 1 and analyzed by 
anti-p24 sandwich BUS A, 
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Anti-p24 titers in mice that received MRK Ad5 HIV-1 gag (107 and 109 
vp(viral particle) doses) were comparable (Figure 13) to those of the research lot of 
Ad5HIV-l gag, for which much of the early rhesus data were generated on. Ttiese 
titers were also comparable when E3 is deleted (MRKAd5hCMVgagbGHpA(E3-)) or 

5 SPA is substituted for bGHpA terminator (MRKAdS hCMV-gag-SPA (E3+)) or 
murine CMV promoter is used in place of hCMV (MRKAdS mCMV-gag-bGHpA 
(E3+)) in the MRKAdS backbone. 

The results shown in Table 7 indicate that the three other vectors ( in addition 
to the preferred vector, MRK AdS HIV-1 gag, ate also capable of inducing strong 

10 anti-gag antibody responses in mice. Interestingly enough, while the mCMV-ELgag 
construct containing bOHpA and E3+ in an El parallel orientation showed lowest gag 
expression in the COS cell in vitro infection (Table 6) in comparison with the other 
vectors tested, it generated the greatest anti-gag antibody response this in vivo Balb/c 
study. Table 7 also shows a dose response in anti-gag antibody production in both the 

IS research and the clinical lot. As expected, the clinical lot shows reduced anti-gag 
antibody induction at each dosage level compared to the same dosage used for the 
research lot. — 

Table 6: In vitro analysis for gag expression in COS cells by Elisa assay. 



Viral Vectors* 


US gag/4.8xl0e5 COS/10e8 parts/48hr 


MRKAdfgag" ■ 


1.40 


Clinical lot AdSgag" 


1.28 


Research lot AdSgag* 


1.32 


MCMVFL-gagbGHpA" 


0.42 



* A260iiin absorbance readings taken for viral particle determinations. 

* MRKAdSgag was produced in serum free conditions and purified at PS. 
CUnical lot# AdSgagFNOOOl 

25 ^ Research AdSFLgag lot# 6399 

* mCMVEL-gagbGHpA was produced in serum free conditions and purified at PS. 
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Table 7: mHIV020 Anti-p24 Ab Titers in Balb/c mice (n=10) vaccinated with various Adgag 
constnicts and lots (3 week post dosel). 



Group 
ID 


Vaccine 


Dose 
(vp) 


GMT 


SE upper 


SE lower 


1 
2 


"iMRKAdSgag 
« 


10^7 
10^ 


25600 
409600 


5877 
94028 


4780 
76473 


3 
4 


hCMV FL-gag bGHpA [E3-] 
« 


10^7 
10A9 


7352 
235253 


2077 
59767 


1620 
47659 


5 
6 


bCm FL-gag SPA [E3+] -4 
* 


10^7 
10^ 


12800 
310419 


9905 
99181 


236 
75165 


7 
8 


'^mCM V FL-gag bGHpA [E3+] 
■ 


10^7 
10^ 


44572 
941014 


23504 
239068 


15389 
190636 


9 

10 


^hCMV FL-gag bGHpA [E3-] 
« 


10^7 
10^9 


3676 
117627 


934 
17491 


745 
15227 


11 
12 
13 
14 


research lot hClUIV intronA FL-gag bGHpA [E3-] <- 

■ 

m 
m 


10^ 
10^7 
10^ 
10^ 


528 
14703 
58813 
204800 


262 
5274 
14942 
S3232 


175 
3882 
11915 
42250 


15 
16 
17 
18 


Clinical lot hCMVIntronA FL-gag bGHpA [E3-] <- 

■ 

a 
a 


10^6 
10^7 
10^ 
10A9 


230 
4222 
19401 
89144 


82 
3405 
3939 
25187 


61 

1138 
3274 
19639 


19 


Narve 


none 


93 


7 


6 



' *2x50 i.m. (quad) injections/animal 
P.I.5: Youil, Chen, Casimiro 
VacdnaUon: T. Toner, Q. Su 
Assay: M. Chen 



"The stnjcture of MRKAd5gag is: hCMVFL-gagbGHpA [E3+] -> The same lot of MRKAd5gag 
used in this rodent study was used in the Rhesus monlcey study (Tables 7 and 8). 
The same lot of mCMVFL-gagbGHpA[E3-i-] used in the in vitro study (Table 6) ws used here. 
Hliis construct was designed by Vollcer Sandlg. It contains a shorter version of the hCMV 
promoter than that used In the MRK constructs. The adenovector backbone is identical to the 
♦ , original bacl<bone used In the original Adgag vector. Expression at 1 0e7 dose from this vector is 7 
fold lower then the same dose of the MRKAd5gag and 4 fold lower than the research lot. 



EXAMPLE 16 

Comparison of Humoral and Cellular Responses Towards the Original Ad-gag 
Construct with the New MRK Ad5 HIV-1 gag in Rhesus Monkeys 
5 Cohorts of 3 rhesus monkeys were vaccinated intramuscularly with MEIK Ad5 

HIV-1 gag or the clinical AdSgag bulk at two doses, 10^^ vp and 10^ vp. 
Immunizations were conducted at week 0, 4, and 25. Serum and PBMC samples were 
collected at selected time points. The serum sample were assayed for anti-p24 Ab 
titers (using competitive based assay) and the PBMCs for antigen-specific HrN- 
10 gamma secretion following overnight stimulation with gag 20-mer peptide pool (via 
ELiSpot assay). 

Hie results shovm in Table 8 indicate comparable responses with respect to 
the generation of anti-gag antibodies. The frequencies of gag-specific T cells in 
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periphM:al blood assummarized in Table 9 demonstrate a strong cellular immune 
response generated after a single dose with the new construct MRK Ad5 HTV-l gag. 
The responses are also boostable with second dose of the same vector. The vector is 
also able to induce CD8+ T cell responses (as evident by lemaining spot counts after 
CD4+ depletion of PBMCs) which are responsible for cytotoxic activity. 

Table 8 Anti-p24 antibody titers Cm mMU/mL) in rhesus macaques immunized with 
gag-expcessing adenovectors (Protocol Hiy203). 
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Table 9. Number of gag-specific T cells per miUion peripheral blood mononuclear 
cells (PBMCs) in rhesus monkeys immunized with gag-expressing adenovectors* 
Also included are those jBrequencies in PBMCs depleted of CD4r^ T cells. 



Grp8 



Vbodndlon 



Monte/ID 



MZKAdStPD 



97N010 
97N010CCD4-) 

97Nn6 
97N116(C04.) 

98X007 
98Xa07CCD4-) 



6 
4 

1 

n 

10 

20 



SPflH" 



69 

3e 

386 
676 
579 
965 



T=AWK 



0 

1 

0 



395 
609 
1304 



Meda 



0 

3 
0 
0 
3 
0 



GOOH 



1058 

993 

534 

593 

2193 

2675 



Tg16Wk 



Media 



0 
4 
1 



GooH 



1174 
395 
2118 



3 
0 
1 
0 
3 
0 



SoaH 



775 
76 
261 
164 
1588 
1666 



Meda 



4 
0 
D 
0 
0 
0 



GoaH 

1074 
594 
408 
666 
2113 
1278 



10^ vp 



97N120 
97N120(CD4-) 

97N144 
97N144eCI74-) 

98X008 

98XOoe(q>4-> 



5 

n 

3 
6 
4 
14 



S75 
170 
236 
148 
368 
696 



1 

6 
1 



249 
438 
1090 



4 
0 
1 

0 
3 
0 



141 

85 
316 
285 
891 
1U5 



4 
3 
4 



119 
256 
673 



9 
0 
1 

ND 
3 
0 



206 

75 

98 

^D 

473 
391 



4 
1 

5 
0 
5 
4 



219 
219 
373 
625 
735 
848 



Ac&Qagcflniaci iof 



97X001 
97X001CaM.> 

97N146 
97N146<!C04-) 

98X009 
98X009CCD4-) 



0 

10 

3 

6 

0 

0 



261 
283 
-150 
133 
93 
73 



1 
1 
3 



485 
465 
339 



0 
3 
0 
0 

3 
0 



817 
996 
339 
370 
559 
333 



0 
1 
0 



1220b 

1272 

896 



1 
0 
3 
0 
1 
0 



894 

1010 

1238 

654 

384 

225 



4 At£mti\n<xi\dt 

10^ vp 



97N020 
97N020CCO4-) 

97X003 
97X003(CD4-) 

98X012 
98X012CCD4-) 



3 

10 

4 

9 

5 

11 



30 
29 
68 
40 
95 
70 



1 

5 
3 



101 
134 
54 



0 
0 
0 
0 
1 
0 



66 

15 

16 

6 

34 

11 



0 
1 

0 



36 
38 
18 



0 
0 
4 
0 
0 
0 



26 
1 

38 
4 

20 
6 



0 
0 
3 
0 
0 
0 



0 
0 
6 
0 
1 
0 



1658 

1123 

1785 

971 

1748 

644 



41 
16 
81 
19 
121 
41 

_ 
9 



96R041 
053F 



6 
14 



8 
18 



1 

5 



1 

16 



0 
20 



0 
14 



D 
19 



0 
IS 



0 
10 



0 
15 



\ 

2A 



Bcsedcneim 4x10^ a 2x10^ adis per veil (c^pendnocnspor d^tty) 
VD. not dBferrrinecl 
Vtxk a no oec*cte oontrd 
5 't>crfof3D<npetfda5 0><eftaxtnoby10oouOtaiLUipaBslTO 



The adenovectors described herein and, particularly, MRK Ad5 HIV-1 gag, 
represent very promising HTV-gag adenovectors with respect to their enhanced 
growth characteristics in both serum and, more importantly, in serum-free media 

10 conditions. In comparison with the current HTV-l gag adenovector construct, MRK 
Ad5 HIV-1 gag shows a 5-10 fold increased amplification rate. We have shown that 
it is genetically stable at passage 21, This construct is able to generate significant 
cellular immune responses in vivo even at a relatively low dose of 10^9 vp. The 
potency of the MRKAdSgag construct is comparable to, if not better than the original 

15 HTV-lgag vector as shown in this rhesus monkey study. 

EXAMPLE 17 

CODON OPTIMIZED HIV-1 POL AND CODON OPTIMZED 

mV-l POL MODIFICATIONS 
20 The open reading frames for the various synthetic pol genes disclosed herein 

comprise coding sequences for the reverse transcriptase (or RT which consists of a 
polymerase and RNase H activity) and integrase (IN). ITie protein sequence is based 
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on that of Hxb2r, a clonal isolate of mB; this sequence has been shown to be closest 
to the consensus clade B sequence with only 16 nonidentical lesidues out of 848 
(Korber, et al., 1998. Human retroviruses and AIDS, Los Alamos National 
Laboratory, Los Alamos, New Mexico). The skiUed artisan wiU understand after 
review of this specification that any available HIV-1 or HIV-2 strain provides a 
potential template for the generation of HIV pol DNA vaccme constructs disclosed . 
herein. It is further noted that the protease gene is excluded from the DNA vaccine 
constructs of the present invention to insure safety from any residual protease activity 
in spite of mutational inactivation. The design of the gene sequences for both wild- 
type (wt-pol) and inactivated pol (lA-pol) incorporates the use of human preferred 
("humanized") codons for each amino acid residue in the sequence m order to 
maximize in vivo mammalian expression (Lathe. 1985. J. Mol Biol. 183:1-12). As 
can be discerned by inspecting the codon usage in SEQ ID NOs: 1, 3, 5 and 7, the 
following codon usage for mammalian optimization is preferred: Met (ATG), Gly 
(GGC), Lys (AAG>, Trp (TOG). Ser CTCC). Arg (AGG). Val (GTG). Pro (CCC). Thr 
(ACQ. Glu (GAG); Leu (CTG). His (CAC). lie (ATC). Asn (AAC). Cys CTGC), Ala 
(GCC)i Ghi (CAG), Phe (TTG) and Tyr (TAG). For an additional discussion relating 
to manimalian (human) codon optimization, see WO 97/31115 (PCr/US97/02294). 
which, as noted elsewhere in this specification, is hereby incorporated by reference. It 
is intended that the sldUed artisan may use alternative versions of codon optimization 
or may omit this step when generating HIV pol vaccine constructs within the scope of 
the present invention. Therefore, the present invention also relates to non-codon 
optimized versions of DNA molecules and associated recombinant adenoviral HIV 
vaccines which encode the various wild type and modified forms of the HIV Pol 
protein disclosed herein. However, codon optimization of these constructs is a 
preferred embodiment of this invention. 

A particular embodiment of this portion of the invention comprisies codon 
optimized nucleotide sequences which encode wt-pol DNA constmcts (herein, "wt- 
pol" or "wt-pol (codon optimized))" wherein DNA sequences encoding the protease 
(PR) activity are deleted, leaving codon optimized "wild type" sequences which 
encode RT (reverse transcriptase and RNase H activity) and IN integrase activity. A 
DNA molecule which encodes this protein is disclosed herein as SEQ ID N0:1, the 
open reading frame being contained fiom an initiating Met residue at nucleotides 10- 
12 to a termination codon from nucleotides 2560-2562. SEQ ID NO: 1 is as foUows: 

AGATCTACCA TGGCCCCCAT CTCCCCCATT GAGACTGTGC CTGTGAAGCT GAAfiCCTGGC 
ATGGATGGCC CCAAGGTGAA GCAGTGGCCC CTGACTGAGG AGAAGATCAA GGCCCTGGTG 
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GAAATCTGCA CTGAGATGGA . GAAGGAGGGC 
TACAACACCC CTGTGTTTGC CATCAAGAAG 
GACTTCAGGG AGCTGAACAA GAGGACCCAG 
CACCCCGCTG GCCTGAAGAA 6AAGAAGTCT 
5 TTCTCTGTGC CCCTGGATGA GGACTTCAGG 
AACAATGAGA CCCCTGGCAT CAGGTACCAG 
TCCCCTGCCA TCTTCCAGTC CTCCATGACC 
CCTGACATTG TGATCTACCA GTACATGGAT 
GGGCAGCACA GGACCAAGAT TGAGGAGCTG 

10 ACCCCTGACA AGAAGCACCA GAAGGAGCCC 
CCCGACAAGT GGACTGTGCA GCCCATTGTG 
6ACATCCAGA AGCTGGTGGG CAAGCTGAAC 
6TGAGGCAGC TGTGCAAGCT GCTGAGGGGC 
ACTGAGGAGG CTGAGCTGGA GCTGGCTGAG 

15 GGGGTGTACT ATGACCCCTC CAAGGACCTG 
CAGTGGACCT ACCAAATCTA CCAGGAGCCC 
AGGATGAGGG GGGCCCACAC CAATGATGTG 
ACCACTGAGT CCATTGTGAT CTGGGGCAAG 
GAGACCTGGG AGACCTGGTG GACTGAGTAC 

20 TTTGTGAACA CCCCCCCCCT GGTGAAGCTG 
GGGGCTGAGA CCTTCTATGT GGATGGGGCT 
GGCTATGTGA CCAACAGGGG CAGGCAGAAG 
AAGACTGAGC TCCAGGCCAT CTACCTGGCC 
GTGACTGACT CCCAGTATGC CCTGGGCATC 

25 GAGCTGGTGA ACCAGATCAT TGAGGAGCTG 
GTGCCTGCCC ACAAGGGCAT TGGGGGCAAT 
ATCAGGAAGG TGCTGTTCCT GGATGGCATT 
CACTCCAACT GGAGGGCTAT GGCCTCTGAC 
ATTGTGGCCT CCTGTGACAA GTGCCAGCTG 

30 TGCTCCCCTG GCATCTGGCA GCTGGACTGC 
GCTGTGCATG TCGCCTGCGG CTACATTGAG 
GAGACTGCCT ACTTCCTGCT GAAGCTGGCT 
GACAATGGCT CCAACTTCAC TGGGGCCACA 
AAGCAGGAGT TTGGCATCCC CTACAACCCC 

35 AAGGAGCTGA AGAAGATCAT TGGGCAGGTG 
GTGCAGATGG CTGTGTTCAT CCACAACTTC 



AAAATCTCCA AGATTGGCCC CGAGAACCCC 
AAGGACTCCA CCAAGTGGAG GAAGCTGGTG 
GACTTCTGGG AGGTGCAGCT GGGCATCCCC 
GTGACTGTGC TGGATGTGGG GGATGCCTAC 
AAGTACACTG CCTTCACCAT CCCCTCCATC 
TACAATGTGC TGCCCCAGGG CTGGAAGGGC 
AAGATCCTGG AGCCCTTCA6 GAAGCAGAAC 
GACCTGTATG TGGGCTCTGA CCTGGAGATT 
AGGCAGCACC TGCTGAGGTG GGGCCTGACC 
CCCTTCCTGT GGATGGGCTA TGAGCTGCAC 
CTGCCTGAGA AGGACTCCTG GACTGTGAAT 
TGGGCCTCCC AAATCTACCC TGGCATCAAG 
ACCAAGGCCC TGACT6AGGT GATCCCCCTG 
AACAGGGAGA TCCTGAAG6A GCCTGTGCAT 
ATTGCTGAGA TCCAGAAGCA GGGCCAGGGC 
TTCAAGAACC TGAAGACTGG CAAGTATGCC 
AAGGAGCTGA CTGAGGCTGT GCA6AAGATC 
ACCCCCAAGT TCAAGCTGCC CATCCAGAAG 
TGGCAGGCCA CCTGGATCCC TGAGTGGGAG 
TGGTACCAGC TGGAGAAGGA GCCCATTGTG 
GCCAACAGG6 AGACCAAGCT GGGCAAGGCT 
GTGGTGACCC TGACTGACAC CACCAACCAG 
CTCCAGGACT CTGGCCTGGA GGTGAACATT 
ATCCAGGCCC AGCCTGATCA GTCTGAGTCT 
ATCAAGAAGG AGAAGGTGTA CCTGGCCTGG 
GAGCAGGTGG ACAAGCTGGT GTCTGCTGGC 
GACAAGGCCC AGGATGAGCA TGAGAAGTAC 
TTCAACCTGC CCCCTGTGGT GGCTAAGGAG 
AAGGGGGAGG CCATGCATGG GCAGGTGGAC 
ACCCACCTGG AGGGCAAGGT GATCCTGGTG 
GCTGAGGTGA TCCCT6CTGA GACAGGCCAG 
GGCAGGTGGC CTGTGAAGAC CATCCACACT 
GTGAGGGCTG CCTGCTGGTG GGCTGGCATC 
CAGTCCCAGG GGGTGGTGGA GTCCATGAAC 
AGGGACCAGG CTGAGCACCT GAAGACAGCT 
AAGAGGAAGG GGGGCATCGG GGGCTACTCC 
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GCTGGG6AGA GGATTGTGGA CATCATTGCC ACAGACATCC AGACCAAGGA GCTCCAGAA6 
CAGATCACCA AGATCCAGAA CTTCAGGGTG TACTACAGGG ACTCCAGGAA CCCCCTGTGG 
AAGGGCCCTG CCAAGCTGCT GTGGAAGGGG. 6AGGGGGCTG TGGTGATCCA GGACAACTCT 
GACATCAAGG TGGTGCCCA6 GAGGAAGGCC AAGATCATCA GG6ACTATGG CAAGCAGAT6 
5 GCTGGG6ATG ACTGTGTGGC CTCCAGGCA6 GATGAGGACT AAAGCCCGGG CAGATCT (SEQ 
ID N0:1) . 

The open reading frame of the wild type pol construct disclosed as SEQ ID 
NO: 1 contains 850 amino acids, disclosed herein as SEQ ID N0:2, as follows: 

Met Ala pro lie Ser Pro lie Glu Thr Val Pro Val Lys Leu Lys Pro 

10 Gly Met Asp Gly Pro Lys Val Lys Gin Trp Pro Leu Thr Glu Glu Lys 
lie Lys Ala Leu Val Glu lie Cys Thr Glu Met Glu Lys Glu Gly Lys 
lie Ser Lys lie Gly Pro Glu Asn Pro Tyr Asn Thr Pro Val Phe Ala 
He Lys Lys Lys Asp Ser Thr Lys Trp Arg Lys Leu Val Asp Phe Arg 
Glu Leu Asn Lys Arg Thr Gin Asp Phe Trp Glu Val Gin Leu Gly Ha 

15 Pro His Pro Ala Qly Leu Lys Lys Lys Lys Ser Val Thr Val Leu Asp 
Val Gly Asp Ala Tyr Phe Ser Val Pro Leu Asp Glu Asp Phe Arg Lys 
Tyr Thr Ala Phe Thr He Pro- Ser He Asn Asn-Glu Thr Pro Gly He 
Arg Tyr Gin Tyr Asn Val Leu Pro Gin Gly Trp Lys Gly Ser Pro Ala 
He Phe Gin Ser Ser Met Thr Lys He Leu Glu Pro Phe Arg Lys Gin 

20 Asn Pro Asp He Val He Tyr Gin Tyr Met Asp Asp Leu Tyr Val Gly 
Ser Asp Leu Glu He Gly Gin His Arg Thr Lys He Glu Glu Leu Arg 
Gin His Leu Leu Arg Trp Gly Leu Thr Thr Pro Asp Lys Lys His Gin 
Lys Glu Pro Pro Phe Leu Trp Met Gly Tyr Glu Leu His Pro Asp Lys 
Trp Thr Val Gin Pro He Val Leu Pro Glu Lys Asp Ser Trp Thr Val 

25 Asn Asp He Gin Lys Leu Val Gly Lys Leu Asn Trp Ala Ser Gin He 
Tyr Pro Gly He Lys Val Arg Gin Leu Cys Lys Leu Leu Arg Gly Thr 
Lys Ala Leu Thr Glu Val He Pro Leu Thr Glu Glu Ala Glu Leu Glu 
Leu Ala Glu Asn Arg Glu He Leu Lya Glu Pro Val His Gly Val Tyr 
Tyr Asp Pro Ser Lys Asp Leu He Ala Glu He Gin Lys Gin Gly Gin 

30 Gly Gin Trp Thr IVr Gin He Tyr Gin Glu' Pro Phe Lys Asn Leu Lys 
Thr Gly Lys Tyr Ala Arg Met Arg Gly Ala His Thr Asn Asp Val Lys 
Gin Leu Thr Glu Ala Val Gin Lys He Thr Thr Glu Ser He Val lie 
Trp Gly Lys Thr Pro Lys Phe Lys Leu Pro He Gin Lys Glu Thr Trp 
Glu Thr Trp Trp Thr Glu Tyr Trp Gin Ala Thr Trp He Pro Glu Trp 
35 Glu Phe val Asn Thr Pro Pro Leu Val Lys Leu Trp Tyr Gin Leu Glu 
Lys Glu Pro He Val Gly Ala Glu Thr Phe Tyr Val Asp Gly Ala Ala 
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Asn Arg Glu Thr Lys Leu Gly Lya Ala Gly Tyr Val Tbr Asn Arg Gly 
Arg Gin Lys Val Val Thr Leu Thr Asp Thr Thr Asn Gin Lys Thr Glu 
Leu Gin Ala lie Tyr Leu Ala Leu Gin Asp Ser Gly Leu Glu Val Asn 
lie Val Thr Asp Ser Gin Tyr Ala Leu Gly lie lie Gin Ala Gin Pro 
5 Asp Gin Ser Glu Ser Glu Leu Val Asn Gin lie He Glu Gin Leu He 
Lys Lys Glu Lys Val Tyr Leu Ala Trp Val Pro Ala His Lys Gly He 
Gly Gly Asn Glu Gin Val Asp Lys Leu Val Ser Ala Gly He Arg Lys 
Val Leu Phe Leu Asp Gly He Asp Lys Ala Gin Asp Glu His Glu Lys 
Tyr His Ser Asn Trp Arg Ala Met Ala Ser Asp Phe Asn Leu Pro Pro 

10 Val Val Ala Lys Glu He Val Ala Ser Cys Asp Lys Cys Gin Leu Lys 
Gly Glu Ala Met His Gly Gin Val Asp Cys Ser Pro Gly He Trp Gin 
Leu Asp Cys Thr His Leu Glu Gly Lys Val He Leu Val Ala Val His 
Val Ala Ser Gly Tyr He Glu Ala Glu Val He Pro Ala Glu Thr Gly 
Gin Glu Thr Ala Tyr Phe Leu Leu Lys Leu Ala Gly Arg Trp Pro Val 

15 Lys Thr He His Thr Asp Asn Gly Ser Asn Phe Thr Gly Ala Thr Val 
Arg Ala Ala Cys Trp Trp Ala Gly He Lys Gin Glu Phe Gly He Pro 
Tyr Asn Pro Gin Ser Gin Gly Val Val Glu Ser Met Asn Lys Glu Leu 
Lys Lys He He Gly Gin Val Arg Asp Gin Ala Glu His Leu Lys Thr 
Ala Val Gin Met Ala Val Phe He His Asn Phe Lys Arg Lys Gly Gly 

20 He Gly Gly Tyr Ser Ala Gly Glu Arg He Val Asp He He Ala Thr 
Asp He Gin Thr Lys Glu Leu Gin Lys Gin He Thr Lys He Gin Asn 
Phe Arg Val Tyr Tyr Arg Asp Ser Arg Asn Pro Leu Trp Lys Gly Pro 
Ala Lys Leu Leu Trp Lys Gly Glu Gly Ala Val Val He Gin Asp Asn 
Ser Asp He Lys Val Val Pro Arg Arg Lys Ala Lys He He Arg Asp 

25 Tyr Gly Lys Gin Met Ala Gly Asp Asp Cys Val Ala Ser Arg Gin Asp 
Glu Asp (SEQ ID N0:2) . 

The present invention especially relates to an adenoviral vector vaccine which 
comprises a codon optimized HIV-1 DNA pol construct wherein, in addition to 
deletion of the portion of the wild type sequence encoding the protease activity, a 

30 combination of active site residue mutations are introduced which are deleterious to 
HIV-1 pol (RT-RH-IN) activity of the expressed protein. Therefore, the present 
invention preferably relates to an adenoviral HIV-1 DNA pol-based vaccine wherein 
the construct is devoid of DNA sequences encoding any PR activity, as well as 
containing a mutation(s) which at least partially, and preferably substantially, 

35 abolishes RT, RNase and/or IN activity. One type of HIV-1 pol mutant which is part 
and parcel of an adenoviral vector vaccine may include but is not limited to a mutated 
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DNA molecule comprising at least one nucleotide substitution which results in a point 
mutation which effectively alters an active site within the RT, RNase and/or IN 
regions of the expressed protein, resulting in at least substantially decreased 
enzymatic activity for the RT, RNase H and/or IN functions of HTV-l Pol. In a 

5 preferred embodiment of this portion of the invention, a HIV-1 DNA pol construct 
contains a mutation or mutations within the Pol coding region which effectively 
aboUshes RT, RNase H and IN activity. An especially preferable HIV-1 DNA pol 
construct in a DNA molecule which contains at least one point mutation which alters 
the active site of the RT, RNase H and IN domains of Pol, such that each activity is at 

10 least substantially abolished. Such a HIV-1 Pol mutant will most likely comprise at 
least one point mutation in or around each catalytic domain responsible for RT, 
RNase H and IN activity, respectfully. To this end. an especially preferred HIV-1 
DNA pol construct is exemplified herein and contains nine codon substitution 
mutations which results in an inactivated Pol protein (lA Pol: SEQ ID N0:4, Figure 

15 17A-Q which has no PR, RT, RNase or IN activity, wherein three such point 

mutations reside within each of the RT, RNase and IN catalytic domains. Therefore, 
an especiaUy preferred exemplification is an adenoviral vaccine which comprises, in 
an appropriate fashion, a DNA molecule which racodes lA-pol, which contains all 
nine mutations as shown below in Table 1. An additional preferred amino add 

20 residue for substitution is Asp551, localized within the RNase domain of Pol. Any 
combination of the mutations disclosed herein may suitable and therefore may be 
utilized as an lA-Pol-based vaccine of the present invention. While addition and 
deletion mutations are contemplated and witiiin the scope of the invention, tiie 
preferred mutation is a point mutation resulting in a substitiition of the wild type 

25 amino acid with an alternative amino acid residue. 

Table! 





wt aa 


aa residue 


mutant aa 


enzvme function 




Asp 


112 


Ala 


RT 




Asp 


187 


Ala 


RT 


30 


Asp 


188 


Ala 


RT 




Asp 


445 


Ala 


RNase H 




Glu 


480 


Ala 


RNase H 




Asp 


500 


Ala 


RNase H 




Asp 


626 


Ala 


EST 


35 


Asp 


678 


Ala 


IN 




Glu 


714 


Ala 


IN 
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It is preferred that point mutations be incorporated into the lApol mutant adenoviral 
vaccines of the present invention so as to lessen the possibility of altering epitopes in 
and around the active site(s) of BQV-l Pol. 

To this end, SEQ ID N0:3 discloses the nucleotide sequence which codes for 
5 a codon optimized pol in addition to the nine mutations shown in Table 1, disclosed as 
follows, and referred to herein as "lApol": 







TGGCCCCCAT 


CTCCCCCATT 


GAGACTGTGC 


CTGTGAAGCT 


GAAGCCTGGC 






CCAAGGTGAA 


GCAGTGGCCC 


CTGACTGAGG 


AGAAGATCAA 


GGCCCTGGTG 






CTGAGATGGA 


6AAGGAGGGC 


AAAATCTCCA 


AGATTGGCCC 


C6AGAACCCC 


10 

1 


TACAACACCC 


CTGTGTTTGC 


CATCAAGAAG AAGGACTCCA 


CCAAGTGGAG 


GAAGCTGGT6 






AGCTGAACAA 


GAGGACCCAG 


GACTTCTGGG 


AGGTGCAGCT 


GGGCATCCCC 






GCCTGAAGAA GAAGAAGTCT GTGACTGTGC 


TGGCTGTGGG 


GGATGCCTAC 




X X^ X ^ X w X 


CCCTGGATGA 


GGACTTCAGG AAGTACACTG 


CCTTGACCAT 


CCCCTCCATC 




AAPAATGAGA 


CCCCTGGCAT 


CAGGTACCAG 


TACAATGTGC 


TGCCCCAGGG 


CTGGAAGGGC 




TPPPPTfiPPA 

X X\3\«\*A 


TCTTCCAGTC 


CTCCAT6ACC 


AAGATCCTGG 


AGCCCTTCAG 


GAAGCAGAAC 




PPTRAPATTG 

X UXl^fX X X w 


TGATCTACCA 


GTACATGGCT 


GCCCTGTATG 


TGGGCTCT6A 


CCTGGAGATT 




GGGP AfiPAPA 


GGACCAAGAT 


TGAGGAGCTG 


AGGCAGCACC 


TGCTGAGGTG 


GGGCCTGACC 




APPPPTGAPA 


A6AAGCACCA 


GAAGGAGCCC 


CCCTTCCTGT 


GGATGGGCTA 


TGAGCTGCAC 




CPCGACAAGT 


GGACTGTGCA GCCCATTGTG CTGCCTGAGA 


AGGACTCCTG 


GACTGTGAAT 


20 


GAP ATPPAG A 


AGCTGGTGGG 


CAAGCTGAAC 


TGGGCCTCCC 


AAATCTACCC 


TGGCATCAAG 






TGTGCAAGCT 


GCT6AGGGGC 


ACCAAGGCCC 


TGACTGA6GT 


GATCCCCCTG 




APTGAGGAGG 

A\.» X wAw wAww 


CTGAGCTGGA GCTGGCTGAG AACAGGGAGA 


TCCT6AAGGA 


GCCTGTGCAT 




GGGGTCTACT 


ATGACCCCTC 


CAAGGACCTG ATTGCTGAGA 


TCCAGAAGCA 


6GGCCAGGGC 




CAGTGGACCT 


ACCAAATCTA 


CCA6GAGCCC 


TTCAAGAACC 


TGAAGACTGG 


CAAGTATGCC 


25 


AGGAT6AGG6 


GGGCCCACAC 


CAATGATGTG AAGCAGCTGA 


CTGAGGCTGT 


GCAGAAGATC 




ACCACTGAGT 


CCATTGTGAT 


CTGGGGCAAG ACCCCCAAGT 


TCAAGCTGCC 


CATCCAGAAG 




6AGACCTGG6 


AGACCTGGTG 


GACTGAGTAC 


TGGCAGGCCA 


CCTGGATCCC 


TGAGTGGGAG 




TTTGTGAACA 


CCCCCCCCCT 


GGTGAAGCTG 


TGGTACCAGC 


TGGAGAAGGA 


GCCCATTGTG 




GG66CTGAGA 


CCTTCTATGT 


GGCTGGGGCT 


GCCAACAGG6 


AGACCAAGCT 


GGGCAAGGCT 


30 


GGCTATGTGA 


CCAACAGGGG 


CAGGCAGAAG 


GTGGTGACCC 


TGACTGACAC 


CACCAACCAG 




AAGACTGCCC 


TCCAGGCCAT CTACCTGGCC 


CTCCAGGACT 


CTGGCCTGGA 


GGTGAACATT 




GTGACTGCCT 


CCCAGTATGC 


CCTGGGCATC 


ATCCAGGCCC 


AGCCTGATCA 


GTCTGAGTCT 




GAGCTGGTGA 


ACCAGATCAT 


TGAGGAGCTG 


ATCAAGAAGG 


AGAAGGTGTA 


CCTGGCCTGG 




GTGCCTGCCC 


ACAAGGGCAT 


TGGGGGCAAT 


GAGCAG6TGG 


ACAAGCTGGT 


GTCTGCTGGC 


35 


ATCAGGAAGG 


TGCTGTTCCT 


GGATGGCATT 


GACAAGGCCC 


AGGATGAGCA 


TGAGAAGTAC 




CACTCCAACT 


GGAGGGCTAT 


GGCCTCTGAC 


TTCAACCTGC 


CCCCTGTGGT 


GGCTAAGGAG 
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ATTGTGGCCT CCT6TGACAA GTGCCAGCTG AAGGGGGAGG CCATGCATGG GCAGGTGGAC 
TGCTCCCCTG GCATCTGGCA GCTC6CCTGC ACCCACCTGG AGGGCAAGGT GATCCTGGTG 
GCTGTGCATG TGGCCTCCGG CTACATTGAG GCTGAGGTGA TCCCTGCT6A GACAGGCCAfi 
6AGACTGCCT ACTTCCTGCT GAAGCTGGCT GGCAGGT6GC CTGTGSAAGAC CATCC»CACT 
GCCAATGGCT CCAACTTCAC TGGGGCCACA GTGAGGGCTG CCTGCTGGTG GGCTGGCATC 
AAGCAGGAGT TTGGCATCCC CTACAACCCC CAGTCCCAGG GGGTGGTGGC CTCCA1GAAC 
AAGGAGCTGA AGAA6ATCAT TGGGCAGGTS AGGGACCAGG CTGAGCACCT GAAGACAGCT 
6TGCAGATGG CTGTGTTCAT CCACAACTTC AAGAGGAAG6 GGGGCATCGG GGGCTACTCC 
GCTGGGGAGA GGATTGTG6A CATCATTGCC ACAGACATCC AGACCAAGGA GCTCCAGAAG 
CAGATCACCA AGATCCAGAA CTTCAGGGTG TACTACAGGG ACTCCAGGAA CCCCCTGTGG 
AAGGGCCCTG CCAAGCTGCT GTGGAAGGGG GAGGGGGCTG TGGTGATCCA GGACAACTCT 
GACATCAAGG TCGTGCCCAG GAGGAAGGCC AAGATCATCA GGGACTATG6 CAAGCAGATG 
6CTGGGGATG ACTGTGTGGC CTCCAGGCAG GATGAGGACT AAAGCCCGGG CAGATCT (SBQ ID 
N0:3) . 

In order to produce the lA-pol-based adraioviral vaccines of the present 
invention, inactivation of the enzymatic functions was achieved by replacing a total of 
nine active site residues firom-the enzyme subunits with alanine side-chains. As- - 
shown in Table 1, aU residues that comprise the catalytic triad of the polymerase, 
namely Aspll2, Aspl87, and Aspl88, were substimted with alanine (Ala) residues 
(Larder, et al.. Nature 1987, 327: 716-717; Larder, et al.. 1989, Proc. Natl Acad. ScL 
1989, 86: 4803-4807). Three additional mutations were introduced at Asp445, 
Glu480 and AspSOO to aboUsh RNase H activity (Asp551 was left unchanged in this 
lA Pol construct), with each residue being substituted for an Ala residue, respectively 
(Davies, et al., 1991, Science 252:. 88-95; Schatz, et al., 1989, FEBS Lett. 257: 311- 
314; Mizrahi. et al., 1990, Nucl. Acids. Res. 18: pp. 5359-5353). HIV pol integrase 
function was abolished through three mutations at Asp626, Asp678 and Glu714. 
Again, each of these residues has been substituted with an Ala residue (Wiskerchen, 
et al., 1995, J. Virol. 69: 376-386; Leavitt, et al., 1993. J. Biol. Chem. 268: 2113- 
2119). Amino acid residue Pro3 of SEQ ID N0:4 marks the start of the RT gene. 
The complete amino acid sequence of lA-Pol is disclosed herein as SEQ ID N0:4 and 
Figure 17A-C, as follows: 

Ket Ala Pro lie Ser Pro He Glu Thr Val Pro Val Lys Leu Lys Pro 
Gly Met Asp Gly Pro Lys Val Lys Gin Trp Pro Leu -Chr Glu Glu Lys 
He Lya Ala Leu Val Glu He Cys Thr Glu Met Glu Lys Glu Gly Lys 
He Ser Lys He Gly Pro Glu Asn Pro Tyr Asn Thr Pro Val Phe Ala 
He Lys Lys Lys Asp Ser Thr Lys Trp Arg Lys Leu Val Asp Phe Arg 
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121 11 


T>oii 


Asn 


Lys 


Aver 






AlS 


Pro 


Ala 


\3xy 




vaX 


Civ 


Asp Ala 


lyx 




lyr 


lllx 


Ala 


Phe 


X IXX 








Gin Tyr 






XJL6 


P>io 


Gin 


Ser 


OCX 




K 


IrxO 


Asp 


lie 


VAX 




OCX 


TV em 
riSp 


Leu 


Glu 


Tie 
X xc# 








Leu 


Leu 




10 




wX u 


Pro 


Pro 


Phe 






A XXi. 


Val 


Gin 


Pro 




TV cfM 


TV eT*^ 
ASp 


lie 


Gin 


uys 






JtrX \j 


Gly He 


Lvs 




Julys 




Leu 


Thr 


C5l u 

V7X U 


1< 


jjeu 


Axa 


Glu 


Asn 


Axg 






TV 


Pro 


Ser 






o±y 


oxh 


Trp 


Thr 


lyx 




lux 


to±y 


Lys 


Tyr 








j-ieu 


Thr 


Glu 


aXA 




ITp 


Cl w 


Lys 


Thr 


f X Vi^ 




fill 11 




Trp Trp 


X LIX 






XrXlC? 


Val 


Asn 






T.V/'C! 


wX U 


Pro 


lie 


Val 

V Ci 1 




TV en 


Ax9 


Glu 


Thr 


J_ijr 9 




AX^ 




Lys 


Val 


Val 

V Cli X 




iJ6U 


d Tl 

i9xn 


Ala 


He 


lyx 








Thr 


Ala 


OCX 




TV 


UXll 


Ser 


Glu 


OCX 




Lys 


I»ys 


Glu Lys 


Val 


30 


Gly 


Gly 


Asn 


Glu 


Gin 




Val 


Leu 


Phe 


Leu 


Asp 




Tyr 


His 


Ser 


Asn 


Trp 




Val 


Val 


Ala Lys 


Glu 




Gly 


Glu 


Ala 


Met 


His 


35 


lieu 


Ala 


Cys 


Thr 


His 




Val 


Ala 


Ser Gly 


Tyr 



Thr Gin Asp Phe Trp Glu 
Leu Lys Lys Lys Lys Ser 
Phe Ser Val Pro Leu Asp 
He Pro Ser lie Asn Asn 
Val Leu Pro Gin Gly Trp 
Met Thr Lys He Leu Glu 
He Tyr Gin Tyr Met Ala 
Gly Gin His Arg Thr Lys 
Trp Gly Leu Thr Thr Pro 
Leu Trp Met Gly Tyr Glu 
He Val Leu Pro Glu Lys 
Leu Val Gly Lys Leu Asn 
Val Arg Gin Leu Cys Lys 
Val He Pro Leu Thr Glu 
Glu He Leu Lys Glu Pro 
Asp Leu He Ala Glu He 
Gin He Tyr Gin Glu Pro 
Arg Met Arg Gly Ala His 
Val Gin Lys He Thr Thr 
Lys Phe Lys Leu Pro He 
Glu Tyr Trp Gin Ala Thr 
Pro Pro Leu Val Lys Leu 
Gly Ala Glu Thr Phe Tyr 
Leu Gly Lys Ala Gly Tyr 
Thr Leu Thr Asp Thr Thr 
Leu Ala Leu Gin Asp Ser 
Gin Tyr Ala Leu Gly He 
Glu Leu Val Asn Gin He 
Tyr Leu Ala Trp Val Pro 
Val Asp Lys Leu Val Ser 
Gly He Asp Lys Ala Gin 
Arg Ala Met Ala Ser Asp 
He Val Ala Ser Cys Asp 
Gly Gin Val Asp Cys Ser 
Leu Glu Gly Lys Val He 
lie Glu Ala Glu Val He 
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Val 


Gin Leu Gly 


He 


Val 


Thr Val Leu 


Ala 


Glu Asp Phe Arg 


Lys 


Glu Thr Pro Gly 


He 


Lys 


Gly Ser Pro 


Ala 


Pro 


Phe Arg Lys 


Gin 


Ala Leu Tyr Val 


Gly 


He 


Glu Glu Leu 


Arg 


Asp 


Lys Lys His 


Gin 


Leu 


His Pro Asp 


Lys 


Asp 


Ser Trp Thr 


Val 


Trp Ala Ser Gin 


He 


Leu Leu Arg Gly 


Thr 


Glu 


Ala Glu Leu 


Glu 


Val 


His Gly Val 


Tyr 


Gin 


Lys Gin Gly 


Gin 


Phe 


Lys Asn Leu 


Lys 


Thr Asn Asp Val 


Lys 


Glu 


Ser He Val 


He 


Gin 


Lys Glu Thr 


Trp 


Trp 


He Pro Glu 


Trp 


Trp 


Tyr Gin Leu 


Glu 


Val 


Ala Gly Ala 


Ala 


Val 


Thr Asn Arg 


Gly 


Asn 


Gin Lys Thr 


Ala 


Gly 


Leu Glu Val 


Asn 


He 


Gin Ala Gin 


Pro 


He 


Glu Gin Leu 


He 


Ala 


His Lys Gly 


lie 


Ala 


Gly He Arg 


Lys 


Asp 


Glu His Glu 


Lys 


Phe 


Asn Leu Pro 


Pro 


Lys 


Cys Gin Leu 


Lys 


Pro 


Gly He Trp 


Gin 


Leu 


Val Ala Val 


His 


Pro 


Ala Glu Thr 


Gly 
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Gin Glu Thr Ala Tyr Phe Leu Leu Lys Leu Ala Gly Arg Trp Pro Val 
Lys Thr He His Thr Ala Asn Gly Ser Asn Phe Thr Gly Ala Thr Val 
Arg Ala Ala Cys Trp Trp Ala Gly He Lys Gin Glu Phe Gly He Pro 
Tyr Asn Pro Gin Ser Gin Gly Val Val Ala Ser Met Asn Lys Glu Leu 
5 Lys Lys He He Gly Gin Val Arg Asp Gin Ala Glu His Leu Lys Thr 
Ala Val Gin Met Ala Val Phe He His Asn Phe Lys Arg Lys Gly Gly 
He Gly Gly Tyr Ser Ala Gly Glu Arg He Val Asp He He Ala Thr 
Asp He Gin Thr Lys Glu Leu Gin Lys Gin He Thr. Lys He Gin Asn 
Phe Arg Val Tyr Tyr Arg Asp Ser Arg Asn Pro Leu Trp Lys Gly Pro 
10 Ala Lys Leu Leu Trp Lys Gly Glu Gly Ala Val Val He Gin Asp Asn 
Ser Asp He Lys Val Val Pro Arg Arg Lys Ala Lys He He Arg Asp 
Tyr Gly Lys Gin Met Ala Gly Asp Asp Cys Val Ala Ser Arg Gin Asp 
Glu Asp (SEQ ID N0:4) . 

As noted above, it will be undMStood that any combination of the mutations 

1 5 disclosed above may be suitable and therefore be utilized as an lA-pol-based 

adenoviral HIV vaccine of the present invention, either when administered alone or in 

- - a combined modality regime and/or a prime-boost regimen. For example, it may be - 
possible to mutate only 2 of the 3 residues within the respective reverse transcriptase, 
RNase-H, and integrase coding regions while stiU abolishing these enzymatic 

20 activities. However, the lA-pol construct described above and disclosed as SEQ ID 
N0:3, as well as the expressed protein (SEQ ID NO:4;) is preferred. It is also 
preferred that at least one mutation be present in each of the three catalytic domains. 

Another aspect of this portion of the mvention are codon optimized HIV-1 
Pol-based vaccine constructions which comprise a eukaryotic trafficking signal 

25 peptide such as frran tPA (tissue-type plasminogen activator) or by a leader peptide 
such as is found in highly expressed mamoDoalian proteins such as immunoglobulin 
leader peptides. Any functional leader peptide may be tested for efficacy. Howevrar, 
a preferred embodiment of the present invention, as witii HIV-1 Nef consti^cts shown 
herein, is to provide for a HTV-I Pol mutant adenoviral vaccine construction wherein 

30 tiie pol coding region or a portion hereof is operatively Unked to a leader peptide, 
preferably a leader peptide firom human tPA. In other words, a codon optimized 
mV-l Pol mutant such as lA-Pol (SEQ ID N0:4) may also comprise a leader peptide 
at the amino termmal portion of the protein, which may effect ceUular trafficking and 
hence, immunogenicity of the expressed protein wifliin the host ceU. As noted in 

35 Figure 16A-B, a DNA vector which may be utilized to practice the present tavention 
may be modified by known recombmant DNA metiiodology to contain a leader signal 
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peptide of interest, such that downstream cloning of the modified HIV-1 protein of 
interest results in a nucleotide sequence which encodes a modified HIV-1 tP A/Pol 
protein. In the alternative, as noted above, insertion of a nucleotide sequence which 
encodes a leader peptide may be inserted into a DNA vector housing the open reading 
5 frame for the Pol protein of interest. Regardless of the cloning strategy, the end result 
is a polynucleotide vaccine which comprises vector components for effective gene 
expression in conjunction with nucleotide sequences which encode a modified HIV-1 
Pol protein of interest, including but not limited to a HTV-l Pol protein which 
contains a leader peptide. Ilie amino acid sequence of the human tPA leader utilized 

10 herein is as f oUows: MDAMKRGLCCVLLLCGAVFVSPSEISS (SEQ ID NO: 17). 
Therefore, another aspect of the present invention is to generate HIV-1 Pol-based 
vaccine constructions which comprise a eukaryotic trafficking signal peptide such as 
from tPA. To this end, the present invention relates to a DNA molecule which 
encodes a codon optimized wt-pol DNA construct wherein the protease (PR) activity 

IS is deleted and a human tPA leader sequence is fused to the 5' end of the coding region. 
A DNA molecule which encodes this protein is disclosed herem as SEQ ID N0:5, tiie 
open reading firame disclosed herein as SEQ ID N0:6. 

To this end, the present invention relates to a DNA molecule which encodes a 
codon optimized wt-pol DNA construct wherein the protease (PR) activity is deleted 

20 and a human tPA leader sequence is fused to tiie 5* end of the coding region ( herein, 
"tPA-wt-pol"). A DNA molecule which encodes this protein is disclosed herein as 
SEQ ID NO:5, the open reading frame being contained from an initiating Met residue 
at nucleotides 8-10 to a termination codon from nucleotides 2633-2635. SEQ ID 
N0:5 is as follows: 



25 


GATCACCATG 


GATGCAATGA 


AGAGA6GGCT 


CTGCTGTGTG 


CTGCTGCTGT 


GTGGAGCAGT 




CTTCGTTTCG 


CCCAGCGAGA 


TCTCCGCCCC 


CATCTCCCCC 


ATTGAGACTG 


TGCCTGTGAA 




GCTGAAiGCCT 


GGCATGGATG 


GCCCCAAGGX 


GAAGCAGTGG 


CCCCTGACTG 


AGGAGAAGAT 




CAAGGCCCTG 


GTGGAAATCT 


GCACTGAGAT 


GGA6AAGGAG 


GGCAAAATCT 


CCAAGATTGG 




CCCCG^GAAC 


CCCTACAACA 


CCCCTGTGTT 


TGCCATCAAG 


AAGAAGGACT 


CCACCAAGTG 


30 


GAGGAAGCTG 


GTGGACTTCA 


GGGAGCTGAA 


CAAGAGGACC 


CAGGACTTCT 


GGGAGGTGCA 




GCTGGGCATC 


CCCCACCCCG 


CTGGCCTGAA 


GAAiSAAGAAG 


TCTGTGACTG 


TGCTGGATGT 




6GGGGAT6CC 


TACTTCTCTG 


TGCCCCTGGA 


TGAGGACTTC 


AGGAAGTACA 


CTGCCTTCAC 




CATCCCCTCC 


ATCAACAATG 


AGACCCCTGG 


CATCAGGTAC 


CAGTACAATG 


TGCTGCCCCA 




GGGCTGGAAG 


GGCTCCCCTG 


CCATCTTCCA 


GTCCTCCATG 


ACCAAGATCC 


TGGAGCCCTT 


35 


CAGGAAGCA6 


AACCCTGACA 


TTGTGATCTA 


CCAGTACATG 


GATGACCTGT 


ATGTGGGCTC 




TGACCTGGAG 


ATTGGGCAGC 


ACA6GACCAA 


GATTGAGGAG 


CTGAGGCAGC 


ACCTGCTGAG 
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GT6GGGCCTG ACCACCCCTG ACAAGAAfiCA CCAGAAGGAG CCCCCCTTCC TGTQGATGGG 
CTATGAGCTG CACCCCGACA AGTCGACT6T GCAGCCCATT GTGCTGCCTG AGAAGGACTC 
CTGGACTGTG AAT6ACATCC AGAAGCTG6T GGGCAAGCTG AACTGGGCCT CCCAAATCTA 
CCCTGGCA1C AAGGTGAGGC AGCTGTGCAA GCTGCTGAGG GGCACCAAGG CCCTGACTGA 
GGTGATCCCC CTSACTGAGG AGQCTGAGCT GGAGCTGGCT 6AGAACA<3GG AGATCCTGAA 
GGftGCCTGTG CATGGGGTGT ACTATGACCC CTCCAAGGAC CTGATTGCTG ASATCCAGAA 
GCAGGGCCAG GGCCAGTGGA CCTACCAAAT CTACCAGGAG CCCTTCAAGA ACCTGAAGAC 
TGGCAAGTAT GCCAGGATGA GGGGGGCCCA CACCAATGAT GTGAAGCAGC TGACTGAGGC 
TGTGCAGAAG ATCACCACTG AGTCCATTGT GATCTGGGGC AAGACCCCCA AGTTCAAGCT 
GCCCATCCAG AAGGAGACCT GGGAGACCTG GTGGACTGAG TACTGGCAG6 CCACCTGGAT 
CCCTGAGTGG GAGTTTGTGA ACACCCCCCC CCTGGTGAAG CTGTGGTACC AGCTGGAGAA 
GGAGCCCATT GTGGGGGCTG AGACCTTCTA TGTGGATGGG GCTGCCAACA 666A6ACCAA 
GCTGGGCAAG GCTGGCTATG TGftCCAACAG GGGCAGGCAG AAGGTGGTGA CCCTGACTGA 
CACCACCAAC CAGAAGACTG AGCTCCAGGC CATCTACCK3 GCCCTCCAGG ACTCTGGCCT 
GGAGGTGAAC ATTGTGACTG ACTCCCAGTA TGCCCTGGGC ATCATCCAGG CCCAGCCTGA 
TCAGTCTGAG TCT6AGCTG6 TGAACCAGAT CATTGAGCAG CTGATCAAGA AGGAGAAGGT 
GTACCTGGCC TGGGTGCCTG CCCACAAGG6 CATTGGGGGC-AATGAGCAGG TG6ACAAGCT - 
GGTGTCTGCT GGCATCAGGA AGGTGCTGTT CCTGGATGGC ATTGACAAGG GCCAGGATGA 
GCATOAGAAG TACCACTCCA ACTGGAGGGC TATGGCCTCT GACTTCAACC TGCCCCCTGT 
GGTGGCTAAG GAGATTGTGG CCTCCTGTGA CAAGTGCCAG CTGAAGGGGG AGGCCATGCA 
TQGGCAGGTG GACTGCTCCC CTGGCATCTG GCAGCTGGAC TGCACCCACC TGGAGGGCAA 
GGTGATCCTG GTGGCTGTGC ATGTGGCCTC CGGCTACATT 6AGGCTGA66 TGATCCCTGC 
TGAGACAGGC CAGGAGACTG CCTACTTCCT GCT6AAGCTG GCTGGCAGGT GGCCTGTGAA 
GACCATCCAC ACTGACAATG GCTCCAACTT CACTGGGGCC ACAGTGAGGG CTGCCTGCTG 
6TGGGCTGGC ATCAAGCAGG AGTTTGGCAT CCCCTACAAC CCCCAGTCCC AGGGGGTGGT 
GGAGTCCATG AACAAGGAGC TGAAGAAGAT CATTGGGCAG GTGAGGGACC AGGCTGAGCA 
CCTGAAGACA GCTGTGCAGA TGGCTGTGTT CATCCACAAC TTCAAGAGGA AGGGGGGCAT 
CGGGGGCTAC TCCGCTGGG6 AGAG6ATTGT GGACATCATT GCCACAGACA TCCAGACCAA 
GGAGCTCCAG AAGCA6ATCA CCAAGATCCA GAACTTCAGG 6TGTACTACA GGGACTCCAG 
6AACCCCCTG TGGAAGGGCC CTGCCAAGCT GCTGTGGAAG GGGGAGGGGG CTGTGGTGAT 
CCAGGACAAC TCTGACATCA AGGTGGTGCC CAGGAGGAAG GCCAAGATCA TCAGGGACTA 
TGGCAAGCAG ATGGCTGGGG ATGACTGTGT GGCCTCCAGG CAGGATGAGG ACTAAAGCCC 
GGGCAGATCT (SEQ ID NO: 5). 

The open reading ftame of the wild type tPA-pol construct disclosed as SEQ 
E) NO:5 contains 875 amino acids, disclosed herein as SEQ ID N0:6. as follows: 

Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 



66 



wo 02/22080 



PCT/USOl/28861 





Ala 


Val 


Phe 


Val Ser Pro Ser 




Glu 




Val 


Pro Val Lvs Leu 






Gin 


Tm 


Pro Leu Thr Glu 




Cvs 

w jr » 


Thr 


Glu 


Met Glu Lvs Glu 


5 




Pro 


Tvr 


Asn Thr Pro Val 




IjVS 


Tim 


Arcr 


Lvs Leu Val Aso 




AST) 


Phe 


Tro 


Glu Val Gin Leu 




-l-J jr 0 


IjVS 


IjVS 


Ser Val Thr Val 




Val 

V Trt ill 


Pro 


Leu 


AsD Glu Asp Phe 


10 


Ser 

1 


lie 

^ J* x^ 


Asn 


Asn Glu Thr Pro 




Pro 


Gin 


Gly 


Trp Lys Gly Ser 






lie 


Leu 


Glu Pro Phe Arcr 




Gin 


Tvr 


Met 


Asp Asp Leu Tvr 




His 


Ara 


Thr 


Lvs He Glu Glu 


15 




Thr 


Thr 


Pro Asp Lvs Lvs 




Met 


Glv 


Tvr 


Glu Leu His Pro 




Tjeu 


Pro 

f ^ x^ 


Glu 


Lvs Asp Ser Trp 




Glv 


TiVS 


lieu 


Asn Tro Ala Ser 




Gin 




Cvs 


Lvs Leu Leu Ar<7 


20 


* ^ w 


Lieu 

J-J W M 


Thr 


Glu Glu Ala Glu 






TjVS 


Glu 


Pro Val His Glv 




Tie 


Ala 


Glu 


He Gin Lvs Gin 




Jr 


Gin 


Glu 


Pro Phe Lvs Asn 




TVttci 


Glv 


Ala 


His Thr Asn Asp 


25 




lie 

^ *J- 


Thr 


Thr Glu Ser He 








Pro 


He Gin Lvs Glu 




TTTi 


Gin 


Ala 


Thr Trp He Pro 






Val 


Lvs 


Leu Trp Tvr Gin 




Glu 


Thr 


Phe 


Tyr Val Asp Gly 


30 


Lys 


Ala 


Gly 


Tyr Val Thr Asn 




Thr 


Asp 


Thr 


Thr Asn Gin Lys 




Iteu 


Gin 


Asp 


Ser Gly Leu Glu 




Ala 


lieu 


Gly 


He He Gin Ala 




Val 


Asn 


Gin 


He He Glu Gin 


35 


Ala 


Trp 


Val 


Pro Ala His Lys 




Lys 


Leu 


Val 


Ser Ala Gly He 



Glu He Ser Ala Pro He Ser Pro He 
Lys Pro Gly Met Asp Gly Pro Lys Val 
Glu Lys He Lys Ala Leu Val Glu He 
Gly Lys He Ser Lys He Gly Pro Glu 
Phe Ala He Lys Lys Lys Asp Ser Thr 
Phe Arg Glu Leu Asn Lys Arg Thr Gin 
Gly He Pro His Pro Ala Gly Leu Lys 
Leu Asp Val Gly Asp Ala Tyr Phe Ser 
Arg Lys Tyr Thr Ala Phe Thr He Pro 
Gly He Arg Tyr Gin Tyr Asn Val Leu 
Pro Ala He Phe Gin Ser Ser Met Thr 
Lys Gin Asn Pro Asp He Val He Tyr 
Val Gly Ser Asp Leu Glu He Gly Gin 
Leu Arg Gin His Leu Leu Arg Trp Gly 
His Gin Lys Glu Pro Pro Phe Leu Trp 
Asp Lys Trp Thr Val Gin Pro He Val 
Thr Val Asn Asp He Gin Lys Leu Val 
Gin He Tyr Pro Gly He Lys Val Arg 
Gly Thr Lys Ala Leu Thr Glu Val He 
Leu Glu Leu Ala Glu Asn Arg Glu He 
Val Tyr Tyr Asp Pro Ser Lys Asp Leu 
Gly Gin Gly Gin Trp Thr Tyr Gin He 
Leu Lys Thr Gly Lys Tyr Ala Arg Met 
Val Lys Gin Leu Thr Glu Ala Val Gin 
Val He Trp Gly Lys Thr Pro Lys Phe 
Thr Trp Glu Thr Trp Trp Thr Glu Tyr 
Glu Trp Glu Phe Val Asn Thr Pro Pro 
Leu Glu Lys Glu Pro He Val Gly Ala 
Ala Ala Asn Arg Glu Thr Lys Leu Gly 
Arg Gly Arg Gin Lys Val Val Thr Leu 
Thr. Glu. Leu Gin Ala He Tyr Leu Ala 
Val Asn He Val Thr Asp Ser Gin Tyr 
Gin Pro Asp Gin Ser Glu Ser Glu Leu 
Leu He Lys Lys Glu Lys Val Tyr Leu 
Gly He Gly Gly Asn Glu Gin Val Asp 
Arg Lys Val Leu Phe Leu Asp Gly He 
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ASP Lys Ala Gin Asp Glu His Glu Lys Tyr His Ser Asn Trp Arg Ala 
Met Ala Ser Asp Phe Asn Leu Pro Pro Val Val Ala Lys Glu He Val 
Ala ser' Cys Asp Lys Cys Gin Leu Lys Gly Glu Ala Met His Gly Gin 
val ASP Cys Ser Pro Gly He Trp Gin Leu Asp Cys Thr His Leu Glu 
Gly Lys val He Leu Val Ala Val His Val Ala Ser Gly Tyr He Glu 
Ala Glu val He Pro Ala Glu Thr Gly Gin Glu Thr Ala Tyr Phe Leu 
Leu Lys Leu Ala Gly Arg Trp Pro Val Lys Thr He His Thr Asp Asn 
Gly ser Asn Phe Thr Gly Ala Thr Val Arg Ala Ala Cys Trp Trp Ala 
Gly He Lys Gin Glu Phe Gly He Pro Tyr Asn Pro Gin Ser Gin Gly 
val val Glu ser Met Asn Lys Glu Leu Lys Lys He He Gly Gin Val 
Arg ASP Gin Ala Glu His Leu Lys Thr Ala Val Gin Met Ala Val Phe 
He His Asn Phe Lys Arg Lys Gly Gly He Gly Gly tyr Ser Ala Gly 
Glu Arg He Val Asp He He Ala Thr Asp He Gin Thr Lys Glu Leu 
Gin Lys Gin He Thr Lys He Gin Asn Phe Arg Val Tyr Tyr Arg Asp 
ser Arg Asn Pro Leu Trp Lys Gly Pro Ala Lys Leu Leu Trp Lys Gly 
Glu Gly Ala val Val He Gin Asp Asn Ser Asp He Lys Val Val Pro 
- Arg Arg Lys Ala-Lys He He Arg Asp Tyr-Gly-tys Gin Met Ala Gly- - - 
Asp Asp cys val Ala Ser Arg Gin Asp Glu Asp {SEQ ID NO: 6) . 

The present invention also relates to a codon optimized HIV-1 Pol mutant 
20 contained within a recombinant adenoviral vector such as lA-Pol (SEQ ID N0:4) 
which comprises a leader peptide at the amino tenninal portion of the protein, which 
may effect cellular trafficking and hence, immunogenicity of the expressed protem 
within the host ceU. Any such adenoviral-based HIV-1 DNA pol mutant disclosed m 
the above paragraphs is suitable for fusion downstream of a leader peptide, such as a 
25 leader peptide including but not limited to the human tPA leader sequence. Therefore, 
any such leader peptide-based HTV-l pol mutant construct may include but is not 
limited to a mutated DNA molecule which effectively alters the catalytic activity of 
the RT RNase and/or IN region of the expressed protein, resulting in at least 
substailtially decreased enzymatic activity one or more of the RT, RNase H and/or IN 
30 functions of mv-l Pol. In a preferred embodiment of this portion of the mvention. a 
leader peptideMIV4 DNA pol construct contains a mutation or-mutations within the 
Pol coding region which effectively abolishes RT, RNase H and IN activity. An 
especially preferable HIV-1 DNA pol constmct is a DNA molecule which contains at 
least one point mutation which alters the active site and catalytic activity withm the 
RT RNase H and IN domains of Pol, such that each activity is at least substanually 
abolished, and preferably totally aboUshed. Such a HIV-1 Pol mutant will most hkely 
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comprise at least one point mutation in or around each catalytic donoain responsible for 
RT, RNase H and IN activity, respectfully. An especially preferred embodiment of 
this portion of the invention relates to a human tPA leader fused to the lA-Pol protein 
comprising the nine mutations shown in Table 1. The DNA molecule is disclosed 
5 herein as SEQ ID N0:7 and the expressed tPA-IA Pol protein comprises a fusion 
junction as shown in Figure 18. The complete amino acid sequence of (he expressed 
protein is set forth in SEQ ID N0:8. To this end, SEQ ID NO:7 discloses the 
nucleotide sequence which codes for a human tPA leader fused to the lA Pol protein 
comprising the nine mutations shown in Table 1 (herein, "tPA-opt-IApol"). The open 
10 reading frame begins with the initiating Met (nucleotides 8-10) and terminates with a 
"TAA" codon at nucleotides 2633-2635. The nucleotide sequence encoding tPA- 
lAPol is also disclosed as follows: 

GATCACCATG GATGCAATGA AGAGAGGGCT CTGCTGTGTG CTGCTGCTGT GTGGAGCAGT 
CTTCGTTTCG CCCAGCGAGA TCTCCGCCCC CATCTCCCCC ATTGAGACTG TGCCTGTGAA 

15 GCT6AAGCCT GGCATGGAT6 GCCCCAAG6T GAAGCAGTGG CCCCTGACT6 AGGAGAAGAT 
CAAGGCCCTG GTGGAAATCT GCACTGAGAT GGAGAAGGAG GGCAAAATCT CCAAGATTGG 
CCCCGAGAAC CCCTACAACA CCCCTGTGTT TGCCATCAAG AAGAAGGACT CCACCAAGTG 
GAGGAAGCTG GTGGACTTCA GGGAGCTGAA CAAGAGGACC CAGGACTTCT GGGAGGTGCA 
GCTGGGCATC CCCCACCCCG CTGGCCTGAA GAAGAAGAAG TCTGTGACTG TGCTGGCTGT 

20 GGGGGATGCC TACTTCTCTG TGCCCCTGGA TGAGGACTTC AGGAAGTACA CTGCCTTCAC 
CATCCCCTCC ATCAACAATG AGACCCCTGG CATCAGGTAC. CAGTACAATG TGCTGCCCCA 
GGGCTGGAAG GGCTCCCCTG CCATCTTCCA GTCCTCCATG ACCAAGATCC TGGAGCCCTT 
CAGGAAGCAG AACCCTGACA TTGTGATCTA CCAGTACATG GCTGCCCTGT ATGTGGGCTC 
TGACCTGGAG ATTGGGCAGC ACAGGACCAA GATTGAGGAG CTGAGGCAGC ACCTGCTGAG 

25 GTGGGGCCTG ACCACCCCTG ACAAGAAGCA CCAGAAGGAG CCCCCCTTCC TGTGGATGGG 
CTATGAGCTG CACCCCGACA AGTGGACTGT GCAGCCCATT GTGCTGCCTG AGAAGGACTC 
CTGGACTGTG AATGACATCC AGAAGCTGGT GGGCAAGCTG AACTGGGCCT CCCAAATCTA 
CCXrrGGCATC AAGGTGAGGC AGCTGTGCAA GCTGCTGAGG GGCACCAAGG CCCTGACTGA 
GGTGATCCCC CTGACTGAGG AGGCTGAGCT G6AGCTGGCT GAGAACAGGG AGATCCTGAA 

30 GGAGCCTGTG CATGGGGTGT ACTATGACCC CTCCAAGGAC CTGATTGCTG AGAOTCCAGAA 
•6CAGGGCCA6 GGCCAGT6GA CCTACCAAAT CTACCA6GAG CCCTTCAAGA ACCTGAAGAC 
TGGCAAGTAT GCCAGGATGA GGGGGGCCCA CACCAATGAT GTGAAGCAGC TGACTGAGGC 
TGTGCAGAAG ATCACCACTG AGTCCATTGT GATCTGGGGC AAGACCCCCA AGTTCAAGCT 
GCCCATCCAG AAGGAGACCT GGGAGACCTG GTGGACTGAG TACTGGCAGG CCACCTGGAT 

35 CCCTGAGTGG GAGTTTGTGA ACACCCCCCC CCTGGTGAAG CTGTGGTACC AGCTGGA6AA 
GGAGCCCATT GTGGGGGCTG AGACCTTCTA TGTGGCTGGG GCTGCCAACA GGGAGACCAA 
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GCTGGGCAAG GCTGGCTATG TGACCAACAG GGGCAGGCAG AAGGTGGT6A CCCT6ACTGA 
CACCACCAAC CAGAAGACTG CCCTCCAGGC CATCTACCTG GCCCTCCAGG ACTCTCGCCT 
GGAGGTGAAC ATTGTGACTG CCTCCCAGTA TGCCCTG6GC ATCATCCAGG CCCAGCCT6A 
TCAGTCTGAG TCTGAGCTGG TGAACCAGAT CATTGAGCAG CTGATCAAGA AGGAGAAGGT 
6TACCTGGCC TGGGTGCCTG CCCACAAGGG CATTGGGGGC AAT6AGCAGG TGGACAAGCT 
GGTGTCTGCT GGCATCAGGA AGGTGCTGTT CCTGGATGGC ATTGACAAGG CCCAGGATGA 
GCATGAGAAG TACCACTCCA ACTGGAGG6C TATGGCCTCT GACTTCAACC TGCCCCCTGT 
GGTGGCTAAG GACSATTGTGG CCTCCTGTGA CAAGTGCCAG CTGAAGGGGG AGGCCATGCA 
TGGGCAGGTG GACTGCTCCC CTGGCATCTG GCAGCTGGCC TGCACCCACC . TGGAGGGCAA 
GGTGATCCTG GTGGCTGTGC AT6TGGCCTC CGGCTACATT 6AGGCTGAGG. T6ATCCCTGC 
T6AGACAGGC CAGGAGACTG CCTACTTCCT GCT6AAGCTG GCTGGCAGGT GGCCTGT6AA 
GACCATCCAC ACTGCCAATG GCTCCAACTT CACTGGGGCC ACAGTGAGGG CTGCCTGCTG 
GTGGGCTGGC ATCAAGCAGG AGTTTGGCAT CCCCTACAAC CCCCAGTCCC AGGGGGTGGT 
GGCCTCCATG AACAAGGAGC TGAAGAAGAT CATTGGGCAG GTGAGGGACC AGGCTGAGCA 
CCTCAAGACA GCTGTCCAGA TQGCTGTCTT CATCCACAAC TTCAAGAGGA AGGGGGGCAT 
C6GGGGCTAC TCCGCTGGGG AGAG6ATTGT GGACATCATT GCCACA6ACA TCCAGACCAA 
GGAGCTCCAG AAGCAGATCA CGAA6ATCCA-6AACTTCAGG-GT6TAGTACA- GGGACTCCAG - 
GAACCCCCTG TGGAAGGGCC CTCCCAAGCT GCTGTGGAAG GGGGAGGGGG CTGTGGTGAT 
CCAGGACAAC TCTGACATCA AGGTG6TGCC CAGGAGGAAG GCCAAGATCA TCAGGGACTA 
TGGCAAGCAG ATGGCTGGG6 A-reACTGTGT GGCCTCCAGG CAGGATGAGG ACTAAAGCCC 
G6GCA6ATCT (SEQ ID NO: 7) . 

The open reading ftame of the tPA-IA-pol construct disclosed as SEQ E> 
N0:7 contains 875 amino acids, disclosed herein as tPA-IA-Pol and SEQ ID N0:8. as 
follows: 

Met ASP Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
Ala val Phe Val Ser Pro Ser Glu He Ser Ala Pro He Ser Pro He 
Glu Thr Val Pro Val Lys Leu Lys Pro Gly Met Asp Gly Pro Lys Val 
Lys Gin Trp Pro Leu Thr Glu Glu Lys He Lys Ala Leu Val Glu He 
cys Thr Glu Met Glu Lys Glu Gly Lys He Ser Lys He Gly Pro Glu 
Asn Pro Tyr Asu Thr Pro Val Phe Ala He Lys Lys Lys Asp Ser Thr 
Lys Trp Arg Lys Leu Val Asp Phe Arg Glu Leu Asn Lys Arg Thr Gin 
ASP Phe Trp Glu Val Gin Leu Gly He Pro His Pro Ala Gly Leu Lys 
Lys Lys Lys Ser Val Thr Val Leu Ala Val Gly Asp Ala Tyr Phe Ser 
val Pro Leu Asp Glu Asp Phe Arg Lys Tyr Thr Ala Phe Thr He Pro 
ser He Asn Asn Glu Thr Pro Gly He Arg Tyr Gin Tyr Asn Val Leu 
Pro Gin Gly Trp Lys Gly Ser Pro Ala He Phe Gin Ser Ser Met Thr 



70 



wo 02/22080 



PCT/USOl/28861 





IjVS 


lie 

•lU lik W 


lieu 


Glu Pto Phe Ajccr 




Gin 






Ala Ala liSU Tvir 

XVXGi XXX6% XJ^U 




His 




Thr 

X XXX 


livs Tie Glu Glu 






^ mX 


Th-r 

X IIX 


Pi?o AsD Lvs Lvs 


5 

<mf 




Glv 


xyx 


Glu Leu His Piro 






XT i> W 


Glu 

V7X IX 


TjVS Asd Ser Trt) 








T«oii 


Aen Tna Ala Ser 

xikaxx XX ^ ^%xm wcx 








a 


TjVS Leu Ijeu Ajtct 








X XXX 


Glu Glu Ala Glu 


10 


iJXSXA, 


Ujr a 


Glu 


Pro Val His Glv 




lie 


Ala 


Glu 

^7X U 


lie Gin Lvs Gin 






Gin 


Glu 


Pro Phe Lvs Asn 






Glv 


Ala 


His Thr Asn Asd 




XJJr a 


Tie 


THif 

X XXX 


Thr Glu Ser He 


15 


TiVfi! 
XJjr 9 


XJ w U 


s X W 


Tie Glu Lvs Glu 






Gin 


Ala 


Thr Tm Tie Pro 

X XXX X X «kai»W C «• W 






Val 

V ox 


T,\ra 
xj y a 


Tjeu Tm Tvr Gin 




Rill 


Till* 


Phe 

f XXG 


Tvr Val Ala Glv 






Ala 


Glv 


Tvr Val Thr Asn 


20 




A.C5T1 


Thy 

X XXX 


Thr Asn Gin Lvs 






Gin 


AST) 


Ser Glv Leu Glu 




Ala 




Glv 


He He Gin Ala 




Val 


Asn 

x»axx 


Gin 

\9XXX 


He He Glu Gin 




Ala 


Tina 


Val 

V mX 


Pro Ala His Lvs 

C^^\^ «^«JU^& AJjw^ 


25 




U 


Val 


Ser Ala Glv He 

^^^^^b 4»A^^W* ^^tflffj^ m/mmm\^ 




A fin 


TiVfi 
jj a 


Ala 


Gin AsD Glu His 




Met 


Ala 


Seir 


Asn Phe Asn Leu 




Ala 




Pv<? 


Asn Lvs Cvs Gin 




Val 


^ - — 

Asp 


Cys 


Ser Pro Gly xie 


30 


Gly 


Lys 


Val 


He Leu Val Ala 




Ala 


Glu 


Val 


He Pro Ala Glu 




Leu 


Lys 


Leu 


Ala Gly Arg Trp 




Gly 


Ser 


Asn 


Phe Thr Gly Ala 




Gly 


lie 


Lys 


Gin Glu Phe Gly 


35 


Val 


val 


Ala 


Ser Met Asn Lys 




Arg 


Asp 


Gin 


Ala Glu His Leu 



Lys Gin Asn Pro Asp He Val He Tyr 
Val Gly Seir Asp Leu Glu He Gly Gin 
Leu Arg Gin His Leu Leu Arg Trp Gly 
His Gin Lys Glu Pro Pro Phe Leu Trp 
Asp Lys Trp Thr Val Gin Pro He Val 
Thr Val Asn Asp He Gin Lys Leu Val 
Gin He Tyr Pro Gly He Lys Val Arg 
Gly Thr Lys Ala Leu Thr Glu Val He 
Leu Glu Leu Ala Glu Asn Arg Glu He 
Val Tyr Tyr Asp Pro Ser Lys Asp Leu 
Gly Gin Gly Gin Trp Thr Tyz Gin He 
Leu Lys Thr Gly Lys Tyr Ala Arg Met 
Val Lys Gin Leu Thr Glu Ala Val Gin 
Val He Trp Gly Lys Thr Pro Lys Phe 
Thr Trp Glu Thr Trp T2:p Thr Glu Tyr 
Glu. Trp Glu Phe Val Asn Thr Pro Pro 
Leu Glu Lys Glu Pro He Val Gly Ala 
Ala Ala Asn Arg Glu Thr Lys Leu Gly 
Arg Gly Arg Gin Lys Val Val Thr Leu 
Thr Ala Leu Gin Ala He Tyr Leu Ala 
Val Asn He Val Thr Ala Ser Gin Tyr 
Gin Pro Asp Gin Ser Glu Ser Glu Leu 
Leu He Lys Lys Glu Lys Val Tyr Leu 
Gly He Gly Gly Asn Glu Gin Val Asp 
Arg Lys Val Leu Phe Leu Asp Gly He 
Glu Lys Tyr His Ser Asn Trp Arg Ala 
Pro Pro Val Val Ala Lys Glu He Val 
Leu Lys Gly Glu Ala Met His Gly Gin 
Trp Gin Leu Ala Cys Thr His Leu Glu 
Val His Val Ala Ser Gly Tyr He Glu 
Thr Gly Gin Glu Thr Ala Tyr Phe. Leu 
Pro Val Lys Thr He His Thr Ala Asn 
Thr Val Arg Ala Ala Cys Trp Trp Ala 
He Pro Tyr Asn Pro Gin Ser Gin Gly 
Glu Leu Lys Lys He He Gly Gin Val 
Lys Thr Ala Val Gin Met Ala Val Phe 
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He His Asn Phe Lys Arg Lys Gly Gly He Gly Gly Tyr Ser Ala Gly 
Glu Arg He Val Asp He He Ala Thr Asp He Gin Thr Lys Glu I.eu 
Gin Lys Gin He Thr Lys He Gin Asn Phe Arg Val Tyr Tyr Arg Asp 
Ser Arg Asn Pro Leu Trp Lys Gly Pro Ala Lys Leu Leu Trp Lys Gly 
5 Glu Gly Ala Val Val He Gin Asp Asn Ser Asp He Lys Val Val Pro 
Arg Arg Lys Ala Lys He He Arg Asp Tyr Gly Lys Gin Met Ala Gly 
Asp Asp Cys Val Ala Ser Arg Gin Asp Glu Asp (SBQ ID N0:8) . 

EXAMPLE 18 

10 CODON OPTIMIZED HIV-1 NEF AND CODON OPTIMIZED 

mV-l NEF MODIFICATIONS 
Codon optimized version of HIV-1 Nef and HIV-1 Nef modifications are 
essentially as described in U.S. Application Serial No. 09/738,782, filed 
December 15, 2000 and PCT International AppUcation PCr/USOO/34162, also filed 

15 December 15, 2000, both documents which are hereby incorporated by reference. As 
disclosed within the above-mentioned documents, particular embodiments of codon 

optimized Nef and Nef modifications relate to a DNA molecule encoding-HIV-1- Nef— 

bom the HIV-1 jfrl isolate wherein the codons are optimized for expression in a 
mammalian system such as a human. The DNA molecule which encodes this protein 

20 is disclosed herein as SEQ ID N0:9, while the expressed open reading fiame is 
disclosed herein as SEQ ID NO: 10. Another embodiment of Nef-based coding 
regions for use in the adenoviral vectors of the present invention comprise a codon 
optimized DNA molecule encoding a protein containing the human plasminogen 
activator (tpa) leader peptide fused with the NHa-terminus of the HIV-1 Nef 

25 polypeptide. The DNA molecule which encodes this protein is disclosed herein as 
SEQ ID NO: 1 1, while the expressed open reading frame is disclosed herein as SEQ 
ID NO: 12. Another modified Nef optimized coding region relates to a DNA molecule 
encoding optimized HIV-1 Nef wherein the open reading fma& codes for 
modifications at the amino terminal myristylation site (Gly-2 to Ala-2) and 

30 substitution of the Leu-174-Leu-175 dileucine motif to Ala-174-Ala-175, herein 

described as opt nef (G2A, LLAA). The DNA molecule which encodes this protem is 
disclosed herein as SEQ ID NO: 13, while the expressed open reading firame is 
disclosed herein as SEQ ID N0:14. An additional embodiment relates to a DNA 
molecule encoding optimized HIV-1 Nef wherein the amino terminal myristylation 

35 site and dileucine motif have heea deleted, as wdl as comprising a tPA leader peptide. 
This DNA molecule, opt tpanef (IXAA), comprises an open reading frame which 
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encodes a Nef protein containing a tPA leader sequence fused to amino acid residue 
6-216 of HIV-1 Nef (jfirl), wherein Leu-174 and Leu-175 are substituted with Ala-174 
and Ala-175, herein referred to as opt tpanef (LLAA) is disclosed herein as SEQ ID 
NO: 15, while the expressed open reading frame is disclosed herein as SEQ ID NO: 16. 
5 As disclosed in the above-identified documents (U.S. Application Serial No. 

09/738.782 and PCT International Application PCT/USOO/34162) and reiterated 
herein, the following nef-based nucleotide and amino acid sequences which comprise 
the respective open reading frame are as follows: 

1 . The nucleotide sequence of the codon optimized version of HIV-1 jrfl 
10 nef gene is disclosed herein as SEQ ID NO:9, as shown herein: 

GATCTGCCAC CATGGGCGGC AAGTGGTCCA AGAGGTCCGT GCCCGGCTGG TCCACCGTGA 
GGGAGAGGAT GAGGAGGGCC GAGCCCGCCG CCGACAGGGT GAGGAGGACC GAGCCCGCCG 
CCGTGGGCGT GGGCGCCGTG TCCAGGGACC TGGAGAAGCA CGGCGCCATC ACCTCCTCCA 
ACACCGCCGC CACCAACGCC GACTGCGCCT GGCTGGAGGC CCAGGAGGAC GAGGAGGTGG 

15 6CTTCCCCGT GAGGCCCCAG GTGCCCCTGA GGCCCAT6AC CTACAAGGGC GCCGO^GGACC 
TGTCCCACTT CCT6AAGGAG AAGGGCGGCC TGGAGGGCCT GATCCACTCC CAGAAGAGGC 
AGGACATCCT GGACCTGTGG GTGTACCACA CCCAGGGCTA CTTCCCCGAC TGGCAGAACT 
ACACCCCCGG CCCCGGCATC AGGTTCCCCC TGACCTTCGG CTGGTGCTTC AAGCTGGTGC 
CCGTGGAGCC CGAGAAGGTG GAGGAGGCCA ACGAGGGCGA GAACAACTGC CTGCTGCACC 

20 CCATGTCCCA GCACGGCATC GAGGACCCCG AGAAGGAGGT GCTGGAGTGG AGGTTCGACT 
CCAAGCTGGC CTTCCACCAC GTGGCCAGGG AGCTGCACCC CGAGTACTAC AAGGACTGCT 
AAAGCCCGGG C (SEQ ID N0:9) . 

Preferred codon usage is as follows: Met (ATG), Gly (GGC), Lys (AAG), 
Tip (TGG), Ser (TCC), Arg (AGG), Val (GTG), Pro (CCC), Thr (ACC), Glu (GAG); 

25 Leu (CTG), Bis (CAC), He (ATC), Asn (AAC), Cys (TGC), Ala (GCC). Gin (GAG), 
Phe (TTC) and Tyr (TAG). For an additional discussion relating to mammalian 
(human) codon optimization, see WO 97/31115 (PCT/US97/02294), which is hereby 
incorporated by reference. See also Figure 19A-B for a comparion of wild type vs. 
codon optimized nucleotides comprising the open reading frame of HIV-Nef . 

30 The open reading frame for SEQ ID N0:9 above comprises an initiating 

methionine residue at nucleotides 12-14 and a 'TAA" stop codon from nucleotides 
660-662. The open reading frame of SEQ ID N0:9 provides for a 216 amino acid 
HTV-l Nef protein expressed through utilization of a codon optimized DNA vaccine 
vector. The 216 amino acid HIV-1 Nef Qfxl) protein is disclosed herein as SEQ ID 

35 NO: 10, and as follows: 

Met Gly Gly Lys Trp Ser Lys Arg Ser Val Pro Gly Trp Ser Thr Val 
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Arg 61u Arg Met Arg Arg Ala Glu Pro Ala Ala Asp Arg Val Arg Arg 
Thr Glu pro Ala Ala Val Gly Val Gly Ala Val Ser Arg Asp Leu Glu 
Lys His Gly Ala He Thr Ser Ser Asn Thr Ala Ala Thr Asn Ala Asp 
Cys Ala Trp Leu Glu Ala Gin Glu Asp Glu Glu Val Gly Phe Pro Val 

5 Arg Pro Gin Val Pro Leu Arg Pro Met Thr Tyr Lys Gly Ala Val Asp 
Leu Ser His Phe Leu Lys Glu Lys Gly Gly Leu Glu Gly Leu He His 
Ser Gin Lys Arg Gin Asp He Leu Asp Leu Trp Val Tyr His Thr Gin 
Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr Pro Gly Pro Gly He Arg 
Phe Pro Leu Thr Phe Gly TrP Cys Phe Lys Leu Val Pro Val Glu Pro 

10 Glu Lys Val Glu Glu Ala Asn Glu Gly Glu Asn Asn Cys Leu Leu His 
Pro Met Ser Gin His Gly He Glu Asp Pro Glu Lys Glu Val Leu Glu 
Trp Arg Phe Asp Ser Lys Leu Ala Phe His His Val Ala Arg Glu Leu 
His Pro Glu Tyr Tyr Lys Asp Cys (SEQ ID NO:10) . 

HIV-1 Nef is a 216 amino add cytosolic protein which associates with the 

15 inner surface of the host cell plasma membrane through myiistylation of Gly-2 

(Eranchini et al., 1986. Virology 155: 593-599). While not aU possible Nef functions 
have been elucidated, it has become clear that conect trafficking of Nef to the inner _ 
plasma membrane promotes viral replication by altering the host intracellular 
environment to facilitate the early phase of the fflV-1 Ufe cycle and by increasing the 

20 infectivity of progeny viral particles. In one aspect of the invention regarding 
codon-optimized, protein-modified polypeptides, the nef-encoding region of the 
adenovirus vector of the present invention is modified to contain a nucleotide 
sequence which encodes a heterologous leader peptide such that the amino terminal 
region of the expressed protein will contain the leader peptide. The diversity of 

25 function that typifies eukaryotic cells depends upon the stiBctural differentiation of 
tiieir membrane boundaries. To generate and maintain tiiese structures, proteins must 
be transported from dieir site of synthesis in the endoplasmic reticulum to 
predetermined destinations throughout the ceU. This requires that the trafficking 
proteins display sorting signals that are recognized by the molecular machinery 

30 responsible for route selection located at the access points to tiie main tiafficWng 
patiiways. Sorting decisions for most proteins need to be made only once as tiiey 
traverse their biosynttietic pathways since their final destination, the cellular location 
at which they paform thdr function, becomes their permanent residence. 
Maintenance of intraceUular integrity depends in part on the selective sorting and 

35 accurate transport of proteins to their correct destinations. Defined sequence motifs 
ejdst in proteins which can act as 'address labels'. A number of sorting signals have 
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been found associated with the cytoplasmic domains of membrane proteins. An 
effective induction of CTL responses often required sustained, high level endogenous 
expression of an antigen. As membrane-association via myristylation is an essential 
requirement for most of Nef s function, mutants lacking myristylation, by glycine-to- 
5 alanine change, change of the dileucine motif and/or by substitution with a tpa leader 
sequence as described herein, will be functionally defective, and therefore will have 
improved safety profile compared to wild-type Nef for use as an HIV-1 vaccine 
component 

In another embodiment of this portion of the invention, either the DNA vector 

10 or the HTV-l nef nucleotide sequence is modified to include the human tissue-specific 
plasminogen activator (tPA) leader. As shown in Figure 16A-B, a DNA vector may 
be modified by known recombinant DNA methodology to contain a leader signal 
peptide of interest, such that downstream cloning of the modified HIV-1 protein of 
interest results in a nucleotide sequence which encodes a modified HIV-1 tPA/Nef 

IS protein. In the alternative, as noted above, insertion of a nucleotide sequence which 
encodes a leader peptide may be inserted into a DNA vector housing die open reading 
fiame for the Nef protein of inteiest. Regardless of the cloning strategy, the end result 
is a polynucleotide vaccine which comprises vector components for effective gene 
expression in conjunction with nucleotide sequences which encode a modified HTV-l 

20 Nef protein of interest, including but not limited to a HIV-1 Nef protein which 

contains a leader peptide. The amino acid sequence of the human tPA leader utilized 
herein is as follows: MDAMKRGLCCVLULCGAVFVSPSEISS (SEQ ID NO: 17). 

It has been shown that myristylation of Oly-2 in conjunction with a dileucine 
motif in the carboxy region of the protein is essential for Nef-induced down 

25 regulation of CD4 (Aiken et al., 1994, Cell 76: 853-864) via endocytosis. It has also 
been shown that Nef expression promotes down regulation of MHCI (Schwartz et al., 
1996, Nature Medicine 2(3): 338-342) via endocytosis. Hie present invention relates 
in part to DNA vaccines which encode modified Nef proteins altered in trafficking 
and/or functional properties. The modifications introduced into the adenoviral vector 

30 HTV vaccines of the present invention include but are not limited to additions, 
deletions or substitutions to the nef open reading frame which results in the 
expression of a modified Nef protein which includes an amino terminal lead^ 
peptide, modification or deletion of the amino terminal myristylation site, and 
modification or deletion of the dileucine motif within the Nef protein and which alter 

35 function within the infected host cell. Therefore, a central theme of the DNA 

molecules and recombinant adenoviral HTV vaccines of the present invention is (1) 
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host administxation and intraceUular delivery of a codon optimized nef-based 
adenoviral HIV vaccine; (2) expression of a modified Nef protein which is 
immunogenic in terms of eUciting both CTL and Th responses; and, (3) inhibiting or 
at least altering known early viral functions of Nef which have been shown to 

5 promote HIV-1 replication and load within an infected host. "Dierefore, the nef 

coding region may be altered, resulting in a DNA vaccine which expresses a modified 
Nef protein wherein the amino taminal Gly-2 myristylation residue is either deleted 
or modified to express alternate amino acid residues. Also, tiie nef coding region may 
be altered so as to result in a DNA vaccine which expresses a modified Nef protein 

10 wherein the dileucme motif is either deleted or modified to express alternate amino 
acid residues. In addition, the adenoviral vector HIV vaccines of tiie present 
invention also relate to an isolated DNA molecule, regardless of codon usage, which 
expresses a wild type or modified Nef protein as described herein, including but not 
limited to modified Nef proteins which comprise a deletion or substitution of Gly 2, a 

15 deletion or substitiition of Leu 174 and Leu 175 and/or inclusion of a leader sequence. 
Therefore, specific Nef-based constructs further include flie following, as 

- exemplification's and not limitations. - For example.-tiie present invention relates to an - 
adenoviral vector vaccine which encodes modified forms of HTV-l, an open reading 
frame which encodes a Nef protem which comprises a tPA leader sequence fused to 

20 amino acid residue 6-216 of HIV-1 Nef Qfrl) is referred to herein as opt tpanef. The 
nucleotide sequence comprising the open reading frame of opt tpanef is disclosed 
herein as SEQ ID NO:l 1, as shown below: 

CATGGATGCA ATGAAGAGAG GGCTCTGCT6 T6TGCTGCTG CTGTGTGGAG CAGTCTTCGT 
TTCGCCCAGC GAGATCTCCT CCAAGAGGTC CGTGCCCGGC TGGTCCACCG TGAGGGAGAG 

25 GATGAGGAGG GCCGAGCCCG CCGCC6ACA6 GGTGAGGAGG ACCGAGCCCG CCGCCGTGGG 
C6TGGGCGCC GTGTCCAGGG ACCTGGAGAA GCACGGCGCC ATCACCTCCT CCAACACCGC 
CGCCACCAAC GCCGACTGCG CCTGGCTGGA GGCCCAGGAG GACGAGGAGG TGGGCTTCCC 
CGTGAGGCCC CAGCTGCTCC TGAGGCCCAT GACCTACAA6 GGCGCCGTGG ACCTGTCCCA 
CTTCCTGAAG GAGAAGGGCG GCCTGGAGGG CCTGATCCAC TCCCAGAAGA GGCAGGACAT 

30 CCTGGACCTG TGGGTGTACC ACACCCAGG6 CTXCTTCCCC GACTQGCAGA ACTACACCCC 
CGGCCCCGGC ATCAGGTTCC CCCTGACCTT CGGCTGGTGC TTCAAGCTGG- TGCCCGTGGA 
GCCCGAGAAG GTGGAfiGAGG CCAACGAGGG CGAGAACAAC TGCCTGCTGC ACCCCATGTC 
CCAGCACGGC ATCGAGGACC CCGAGAAGGA GGTGCTGGAG TGGAGGTTCG ACTCCAAGCT 
GGCCTTCCAC CACGTGGCCA GGGftfiCTGCA CCCCGAGTAC TACAAGGACT GCTAAAGCC 

35 (SEQ ID NO: 11) . 

The open reading fi^e for SEQ ED NO: 11 comprises an initiating mefliionine 
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residue at nucleotides 2-4 and a *TAA'* stop codon from nucleotides 713-715. The 
open reading frame of SEQ ID N0:3 provides for a 237 amino acid HIV-1 Nef 
protein which comprises a tPA leader sequence fused to amino acids 6-216 of HIV-1 
Nef, including the dileucine motif at amino acid residues 174 and 175. This 237 
5 amino acid tPA/Nef Qfrl) fusion protein is disclosed herein as SEQ ID NO: 12. and is 
shown as follows: 







Asp Ala Met 


Lys 


Arg 


Gly 


Leu Cys Cys Val Leu Leu 


Leu Cys Gly 






\Tsil DVto T7al 
V ax ifuS V O.J. 








Glu He Ser Ser Lvs Arcr 


Ser Val Pro 




Gly 


Trp Ser Thr 


Val 


Arg 


Glu 


Arg Met Arg Arg Ala Glu 


Pro Ala Ala 


10 


Asp 


Arg Val Arg 


Arg 


Thr 


Glu 


Pro Ala Ala Val Gly Val 


Gly Ala Val 




Ser 


Arg Asp Leu 


Glu 


Lys 


His 


Gly Ala He Thr Ser Ser 


Asn Thr Ala 




Ala 


Thr Asn Ala 


Asp 


Cys 


Ala 


Trp Leu Glu Ala Gin Glu 


Asp Glu Glu 




Val 


Gly Phe Pro 


Val 


Arg 


Pro 


Gin Val Pro Leu Arg Pro 


Met Thr Tyr 




Lys 


Gly Ala Val 


Asp 


Leu 


Ser 


His Phe Leu Lys Glu Lys 


Gly Gly Leu 


15 


Glu 


Gly Leu lie 


His 


Ser 


Gin 


Lys Arg Gin Asp He Leu 


Asp Leu Trp 




Val 


Tyr His Thr 


Gin 


Gly 


Tyr 


Phe Pro Asp Tzp Gin Asn 


T^rr Thr Pro 




Gly 


Pro Gly He 


Arg 


Phe 


Pro 


Leu Thr Phe Gly Trp Cys 


Phe Lys Leu 




Val 


Pro Val Glu 


Pro 


Glu 


Lys 


Val Glu Glu Ala Asn Glu 


Gly Glu Asn 




Asn 


Cys Leu Leu 


His 


Pro 


Met 


Ser Gin His Gly He Glu 


Asp Pro Glu 


20 


Lys 


Glu Val Leu 


Glu 


Trp 


Arg 


Phe Asp Ser Lys Leu Ala 


Phe His His 




Val 


Ala Arg Glu 


Leu 


His 


Pro 


Glu Tyr Tyr Lys Asp Cys 


(SEQ ID NO: 12) 



Therefore, this exemplified Nef protein, Opt tPA-Nef , contains both a tPA leader 
sequence as well as deleting the myristylation site of Gly-2A DNA molecule encoding 
HEV-l Nef from the HIV-1 jfrl isolate wherein the codons are optimized for 

25 expression in a mammalian system such as a human. 

In another specific embodiment of the present invention, a DNA molecule is 
disclosed which encodes optimized HTV-l Nef wherein the open reading frame of a 
recombinant adenoviral HIV vaccine encodes for modifications at the amino terminal 
myristylation site (Gly-2 to Ala-2) and substitution of the Leu-174-Leu-175 dileucine 

30 motif to Ala-174-Ala-175. This open reading frame is herein described as opt nef 
(G2AJLLAA) and is disclosed as SEQ ID NO: 13, which comprises an initiating 
methionine residue at nucleotides 12-14 and a 'TAA" stop codon from nucleotides 
660-662. The nucleotide sequence of this codon optimized version of HTV-l jrfl nef 
gene with the above mentioned modifications is disclosed herein as SEQ ID NO: 13, 

35 as follows: 
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GATCTGCCAC CATGGCCGGC AAGTGGTCCA AGAGGTCCGT GCCCGGCT6G TCCACCGTGA 
GGGAGAGGAT GAGGAGGGCC GAGCCCGCC6 CC6ACAGGGT GAGGAGGACC GAGCCC6CCG 
CCGTGGGCGT GGGCGCCGTG TCCAGGGACC TGGAGAAGCA CGGCGCCATC ACCTCCTCCA 
ACACCGCCGC CACCAACGCC GACTGCGCCT GGCTG6AGGC CCAGGAGGAC GAGGAGGTGG 
GCTTCCCCGT GAGGCCCCAG GTGCCCCTGA GGCCCATGAC CTACAAGGGC GCCGTGGACC 
TGTCCCACTT CCTGAAGGAG AAGGGCGGCC TGGAGG6CCT GATCCACTCC CAGAAGAfiGC 
AGGACATCCT GGACCTGTGG GTGTACCACA CCCAGGGCTA CTTCCCCGAC TGGCAGAACT 
ACACCCCCGG CCCCGGCATC AGGTTCCCCC T6ACCTTCGG CTGGTGCTTC AAGCTGGTGC 
CCGTGGAGCC CGA6AAGGTG GAGGAGGCCA ACGAGGGCGA GAACAACTGC GCCGCCCACC 
CCATGTCCCA GCACGGCATC 6AGGACCCC6 AGAAG6AGGT GCTGGAGTGG AGGTTCGACT 
CCAAGCTGGC CTTCCACCAC GTGGCCAGGG AGCTGCACCC CGAGTACTAC AAGGACTGCT 
AAAGCCCGGG C (SEQ ID NO: 13) . 

The open reading frame of SEQ ID NO: 13 encodes Nef (G2A,IXAA), 
disclosed herein as SEQ ID NO: 14, as follows: 

Met Ala Gly Lys Trp Ser Lys Arg Ser Val Pro Gly Trp Ser Thr Val 
Arg Glu Arg Met Arg Arg Ala Glu Pro Ala Ala Asp Arg Val Arg Arg 
Thr Glu Pro Ala Ala Val Gly Val Gly Ala Val Ser Arg Asp Leu Glu - 
Lys His Gly Ala He Thr Ser Ser Asn Thr Ala Ala Thr Asn Ala Asp 
Cys Ala Trp Leu Glu Ala Gin Glu Asp Glu Glu Val Gly Phe Pro Val 
Arg Pro Gin Val Pro Leu Arg Pro Met Thr Tyr Lys Gly Ala Val Asp 
Leu Ser His Phe Leu Lys Glu Lys Gly Gly Leu Glu Gly Leu He His 
Ser Gin Lys Arg Gin Asp He Leu Asp Leu Trp Val Tyr His Thr Gin 
Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr Pro Gly Pro Gly He Arg 
Phe Pro Leu Thr Phe Gly Trp Cys Phe Lys Leu Val Pro Val Glu Pro 
Glu Lys Val Glu Glu Ala Asn Glu Gly Glu Asn Asn Cys Ala Ala His 
Pro Met Ser Gin His Gly He Glu Asp Pro Glu Lys Glu Val Leu Glu 
Trp Arg Phe Asp Ser Lys Leu Ala Phe His His Val Ala Arg Glu Leu 
His Pro Glu Tyr Tyr Lys Asp Cys Ser (SEQ ID N0:14) . 

An additional embodiment of the present invention relates to another DNA 
molecule encoding optimized HIV-1 Nef wherein the amino terminal myristylation 
site and dileucine motif have been deleted, «s well as comprising a tPA leader peptide. 
This DNA molecule, opt tpanef (LLAA) comprises an open reading frame which 
encodes a Nef protein containing a tPA leader sequence fiised to amino add residue 
6-216 of HIV-1 Nef (jfrl). wherein Leu-174 and Leu-175 are substituted with Ala-174 
and Ala-175 (Ala-195 and Ala- 196 in this tPA-based fusion protein). The nucleotide 
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sequence comprising the open reading frame of opt tpanef (LLAA) is disclosed herein 
as SEQ ID NO: IS, as shown below: 



CATGGATGCA 


ATGAAGAGAG 


GGCTCTGCTG 


TGTGCTGCTG 


CTGTGTGGAG 


CAGTCTTCGT 


TTCGCCCAGC 


GAGATCTCCT 


CCAAGAGGTC 


CGTGCCCGGC 


TGGTCCACCG 


TGAGGGAGAG 


GAT6AGGAGG 


GCCGAGCCCG 


CCGCCGACAG 


GGTGAGGAGG 


ACCGAGCCCG 


CCGCCGTGGG 


CGTGGGCGCC 


GTGTCCAGGG 


ACCTGGAGAA 


GCACGGCGCC 


ATCACCTCCT 


CCAACACCGC 


CGCCACCAAC 


GCCGACTGCG 


CCTGGCTGGA 


GGCCCAGGA6 


GACGA6GAGG 


TGGGCTTCCC 


CGTGAGGCCC 


CAGGTGCCCC 


TGAGGCCCAT 


GACCTACAAG 


GGCGCCGTGG 


ACCTGTCCCA 


CTTCCTGAAG 


GAGAAGGGCG 


GCCTGGAGGG 


CCTGATCCAC 


TCCCAGAAGA 


GGCAGGACAT 


CCTGGACCTG 


TGGGTGTACC 


ACACCCAGG6 


CTACTTCCCC 


GACTGGCAGA 


ACTACACCCC 


CGGCCCCGGC 


ATCAGGTTCC 


CCCTGACCTT 


CGGCTGGTGC 


TTCAAGCTGG 


TGCCCGTGGA 


6CCCGAGAAG 


GTGGAGGAGG 


CCAACGAGGG 


CGAGAACAAC 


TGCGCCGCCC 


ACCCCATGTC 


CCAGCACGGC 


ATCGAGGACC 


CCGAGAAGGA 


6GTGCTGGA6 


TGGAGGTTCG 


ACTCCAAGCT 


GGCCTTCCAC 


CACGTGGCCA 


GGGAGCTGCA 


CCCCGAGTAC 


TACAAGGACT 


GCTAAAGCCC 



15 (SEQ ID NO: 15) . 

The open reading frame of SEQ ID NO:7 encoding tPA-Nef (LLAA), 
disclosed herein as SEQ ID NO: 16, is as follows: 





Met 


Asp 


Ala 


Met 


Lys 


Arg 


Gly 


Leu 


Cys 


Cys Val 


Leu 


Leu 


Leu 


Cys 


Gly . 




Ala 


Val 


Phe 


Val 


Ser 


Pro 


Ser 


Glu 


He 


Ser Ser 


Lys 


Arg 


Ser 


Val 


Pro 


20 


Gly 


Trp 


Ser 


Thr 


Val 


Arg 


Glu 


Arg 


Met 


Arg Arg 


Ala 


Glu 


Pro 


Ala 


Ala 




Asp 


Arg 


Val 


Arg 


Arg 


Thr 


Glu 


Pro 


Ala 


Ala Val 


Gly 


Val 


Gly 


Ala 


Val 




Ser 


Arg 


Asp 


Leu 


Glu 


Lys 


His 


Gly 


Ala 


He Thr 


Ser 


Ser 


Asn 


Thr 


Ala 




Ala 


Thr 


Asn 


Ala 


Asp 


Cys 


Ala 


Trp 


Leu 


Glu Ala 


Gin 


Glu 


Asp 


Glu 


Glu 




Val 


Gly 


Phe 


Pro 


Val 


Arg 


Pro 


Gin 


Val 


Pro Leu 


Arg 


Pro 


Met 


Tlir 


Tyr 


25 


Lys 


Gly 


Ala 


Val 


Asp 


Leu 


Ser 


His 


Phe 


Leu Lys 


Glu 


Lys 


Gly 


Gly 


Leu 




Glu 


Gly 


Leu 


lie 


His 


Ser 


Gin 


Lys 


Arg 


Gin Asp 


He 


Leu 


Asp 


Leu 


Trp 




Val 


Tyr 


His 


Thr 


Gin 


Gly 


Tyr 


Phe 


Pro 


Asp Trp 


Gin 


Asn 


Tyr 


Thr 


Pro 




Gly 


Pro 


Gly 


lie 


Arg 


Phe 


Pro 


Leu 


Thr 


Phe Gly 


Trp 


Cys 


Phe 


Lys 


Leu 




Val 


Pro 


Val 


Glu 


Pro 


Glu 


Lys 


Val 


Glu 


Glu Ala 


Asn 


Glu 


Gly 


Glu 


Asn 


30 


Asn 


Cys 


Ala 


Ala 


His 


Pro 


Met 


Ser 


Gin 


His Gly 


He 


Glu 


Asp 


Pro 


Glu 




Lys 


Glu 


Val 


Leu 


Glu 


Trp 


Arg 


Phe 


Asp 


Ser Lys 


Leu 


Ala 


Phe 


His 


His 




Val 


Ala 


Arg 


Glu 


Leu 


His 


Pro 


Glu 


Tyr 


Tyr Lys 


Asp 


Cys 


(SEQ ID 


N0:16) . 



An adenoviral vector of the present invention may comprise a DNA sequence, 
regardless of codon usage, which expresses a wild type or modified Nef protein as 
35 described herein, including but not limited to modified Nef proteins which comprise a 
deletion or substitution of Gly 2, a deletion of substitution of Leu 174 and Leu 175 
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and/or inclusion of a leader sequence. Iherefore, partial or fiiUy codon optimized 
DNA vaccine expression vector constructs are preferred since such constructs should 
result in increased host expression. However, it is within the scope of the present 
invention to utilize "non-codon optimized" versions of flie constructs disclosed heiein, 

5 especially modified versions of HIV Nef which are shown to promote a substantial 
cellular immune response subsequent to host administration. 

Figure 20A-C show nucleotide sequences at junctions between nef coding 
sequence and plasmid backbone of nef expression vectors VlJns/nef (Figure 20A), 
VlJns/nef(G2A,LLAA) (Figure 20B). VlJnsApanef (Figure 20C) and 

10 VlJns/tpanefOLLAA) (Figure 20C, also). 5' and 3' flanking sequences of codon 

optimized nef or codon optimized nef mutant genes arc indicated by boldAtalic letters; 
nef and nef mutant coding sequences are indicated by plain letters. Also indicated (as 
underlined) are the restriction endonuclease sites involved in constmction of 
respective nef expression vectors. VlJns/tpanef andVlJns/tpanef(LLAA)have 

15 identical sequences at the jimctions. 

Figure 21 shows a schematic presentation of nef and nef derivatives. Amino 
acid residues involved m Nef derivatives are.presented.Glycine.2 and Uudnel7-4 _ 

and 175 are the sites involved in myristylation and dileucine motif, respectively. 

2Q EXAMPLE 19 

MRKAdSPol Construction and Virus Rescue 
Construction of vector: shutOe plasmid and pre-adenovirus plasmid - Key 
steps performed in ttie construction of the vectors, including tiie pre-adenovirus 
plasmid denoted MRKAd5pol, is depicted in Figure 22. Briefly, the adenoviral shutde 
25 vector for the full-length mactivated HIV-1 pol gene is as foUows. The vector 

MRKpdelEl(Pac/pIX/pack450)+CMVnun+BGHpA(str.) is a derivative of tiie shuttie 
vector used in tiie construction of tiie MRKAdSgag adenoviral pre-plasmid. The 
vector contains an expression cassette witii tiie hCMV promoter (no intronA) and tiie 
bovine growtii hormone polyadenylation signal. The expression unit has been 
30 inserted into flie shuttie vector such tiiat insertion of tiie gene of choice at a unique 
Bgm site wiU ensure tiie direction of transcription of tiie transgene will be Ad5 El 
parallel when inserted into tiie MRKpAd5(El.AE3+)aal (or MRKpAdHVES) pre- 
plasmid. The vector, similar to ttie original shuttie vector contains tiie Pacl site, 
extension to tiie packaging signal region, and extension to tiie pK gene. The syntiietic 
35 full-lengtii codon-optimized HTV-l pol gene was isolated directiy from tiie plasmid 
pVlJns-HIV-pol-inact(opt). Digestion of tiiis plasmid witii Bgl R releases tiie pol 
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gene intact (comprising a codon optimized lA pol sequence as disclosed in SEQ ID 
NO:3). The pol fragment was gel purifiied and ligated into the 
MRKpdelEl(Pac/pIX/pack450)+CMVmin+BGHpA(str.) shuttle vector at the BgHL 
site. The clones were checked for the correct orientation of the gene by using 
5 lestriction enzymes DrdHUNoth A positive clone was isolated and named 

MRKpdel+hCMVmin+EL-pol+bGHpA(s). The genetic structure of this plasmid was 
verified by PGR, restriction enzyme and DNA sequencing. Hie pre-adenovirus 
plasmid was constructed as follows. Shuttle plasmid MRKpdel+hCMVmin+FL- 
pol+bGHpA(S) was digested with restriction enzymes Pad and BstllOl I (or its 

10 isoschizonxer, 5^^107 T) and then co-transformed into E. coli strain BJ5183 with 
linearized (Clal digested) adenoviral backbone plasmid, MRKpAd(El-/E3+)ClaL 
The resulting pre-plasmid originally named MRKpAd+hCMVmin+FE^ 
pol+bGB^A(S)E3+ is now referred to as **pMRKAd5por'. The genetic structure of 
the resulting pMRKAdSpol was verified by PGR, restriction enzyme and DNA 

15 sequence analysis. The vectors were transformed into competent £1 coli XL-l Blue 
for preparative production. The recovered plasmid was verified by restriction enzyme 
digestion and DNA sequence analysis, and by expression of the pol transgene in 
transient transfection cell culture. The complete nucleotide sequence of this 
pMRKAd5HIV-lpol adenoviral vector is shown in Figure 26 A-AO. 

20 Generation of research-grade recombinant adenovirus - The pre-adenovirus 

plasmid, pMRKAdSpol, was rescued as infectious virions in PER.G6® adherent 
monolayer cell culture. To rescue infectious virus, 12 jig of pMRKAdSpol was 
digested with restriction enzyme Pacl (New England Biolabs) and 3.3 jig was 
transfected per 6 cm dish of PER.G6® cells using the calcium phosphate co- 

25 precipitation technique (Cell Phect Transfection Kit, Amersham Pharmacia Biotech 
Inc.). Pacl digestion releases tfie viral genome from plasmid sequences allowing viral 
replication to occur after entry into PER.G6®cells. Infected cells and media were 
harvested 6-10 days post-transfection, after complete viral cytopathic effect (GPE) 
was observed. Infected cells and media were stored at ^ -60°C. This pol containing 

30 recombinant adenovirus is referred to herein as "MRKAdSpol". This recombinant 
adenovirus expresses an inactivated HIV-1 Pol protein as shown in SEQ ID NO:6. 

EXAMPLE 20 
MRKAdSNef Construction and Virus Rescue 
35 Construction of vector: shuttle plasmid and pre-adenovirus plasmid • Key 

steps performed in the construction of the vectors, including the pre-adenovirus 
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plasmid denoted MRKAdSnef, is depicted in Figure 23. Briefly, as shown in 
Example 19 above, the vector 

MBlKpdeffil(Pac/pIX/pack450)+CMVinin+BGHpA(str.) is the shuttle vector used in 
the construction of the MRKAdSgag adenoviral pre-plasmid. It has been modified to 
contain the Pacl site, extension to the packaging signal region, and extension to the 
pIX gene. It contains an expression cassette with the hCMV promoter (no intionA) 
and the bovine growth hormone polyadenylation signal. The expression unit has been 
inserted into the shuttle vector such that insertion of the gene of choice at a unique 
5^/1 1 site will ensure the direction of transcription of die transgene will be Ad5 El 
parallel when inserted mto the MRKpAd5(El-/E3+)Clal pre-plasmid. The synflietic 
fuU-length codon-optimized HIV-1 nef gene was isolated directty ftom the plasmid 
pVlJns/nef (G2A,LLAA). Digestion of diis plasmid with 5^/11 releases the pol gene 
mtact, which comprises the nucleotide sequence as disclosed in SEQ ID NO: 13. The 
nef fragment was gel purified and ligated into the 

MRKpdeIEl+CMVmin+BGHpA(str.) shutQe vector at the Bglll site. The clones 
were checked for correction orientation of the gene by usmg restriction enzyme Seal. 
- A positive clone was isolated and named MRKpdelElhCMVminFL-nefBGHpA(s).- — 
The genetic struchire of this plasmid was verified by PGR, restriction enzyme and 
DNA sequencing. The pre-adenovirus plasmid was constructed as follows. Shutde 
plasmid MRKpdelElhCMVminFL-nefBGHpA(s) was digested with restriction 
enzymes Pacl and Bstlltn I (or its isoschizomer, 5^^107 1) and then co-ti^formed 
into E. coli stirain BJ5183 with linearized (Oal digested) adenoviral backbone 
plasmid, MRKpAd(El/E3+)aal. The resulting pre-plasmid originally named 
MRKpdelElhCMVminEL-nefBGHpACs) is now refened to as "pMRKAdSnef . The 
genetic stmcture of tiie resulting pMRKAdSnef was verified by PGR. restriction 
enzyme and DNA sequence analysis. The vectors were transformed into competent E. 
coli XL-1 Blue for preparative production. The recovered plasmid was verified by 
restriction enzyme digestion and DNA sequence analysis, and by expression of the 
nef transgene in transient transfection ceU culture. The complete nucleotide sequence 
of this pMRKAdSmV-lnef adenoviral vector is shovm in Figure 27A-AM. 

Generation of research-grade recombinant adenovirus - The pre-adenovirus 
plasmid, pMRKAdSnef, was rescued as infectious virions in PER.C6® adherent 
monolayer cell culture. To rescue infectious virus, 12 ng of pMRKAdnef was 
digested with restriction enzyme Pacl (New England Biolabs) and 3.3 p-g was 
transfected per 6 cm dish of PER.C6® ceUs using the calcium phosphate co- 
precipitation technique (CeU Phect Transfection Kit, Amersham Pharmacia Biotech 
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Inc.). Pacl digestion releases the viral genome from plasmid sequences allowing 
viral replication to occur after entry into PER.C6®cells. Infected cells and media were 
harvested 6-10 days post-transfection, after complete viral cytopathic effect (CPE) 
was observed. Mected cells and media were stored at < -60°C. This nef containing 
5 recombinant adenovirus is now referred to as "MRKAdSnef 

EXAMPLE 21 

Construction of Murine CMV Promoter Containing Shuttle Vectors 

for Inactivated Pol and Ne£^G2A,LLAA 

10 The murine CMV (mCMV) was amplified from the plasmid pMH4 (supplied 

by Frank Graham, McMaster University) using the primer set: mCMV (Not I) 
Forward: 5'-ATA AGA ATGCGGCCGCCA TAT ACT GAG TCA TTA GG-3* 
(SEQ ID NO: 20); mCMV {Bgl II)Rev©rse: 5'-AAG G AA GAT CT A CCG ACG 
CTG GTC GCG CCT C-3' (SEQ ID N0:21). The underlined nucleotides represent 

15 the Not I and the Bgl U sites respectively for each piimar. This PGR amplicon was 
used for the construction of the mCMV shuttle vector containing the transgene in the 
El parallel orientation. The hCMV promoter was removed from the original shuttle 
vector (containing the hCMV-gag-bGHpA transgene in the El parallel orientation) by 
digestion with Not I and Bgl n. The mCMV promoter (Not 11 Bgl M digested PCR 

20 product) was inserted into the shuttle vector in a directional manner. The shuttle 
vector was then digested with Bgl n and the gag reporter gene (Bgl II fragment) was 
re-inserted back into the shutde vector. Several clones were screened for correct 
orientation of the reporter gene. For the construction of the mCMV-gag in the El 
antiparallel orientation, the mCMV promoter was amplified from the plasmid pMH4 

25 using the following primer set: mCMV (Asc I) Forward: 5 - ATA AGA ATGGCG 
CGCCA T ATA CTG AGT CAT TAG G (SEQ ID NO:22); mCMV (Bgl II) Reverse: 
5' AAG GAAGATCTA CCG ACG CTG GTC GCG CCT C (SEQ ID NO:23). The 
underlined nucleotides represent the Asc I and Bgl n sites, respectively for each 
primer. The shutde vector containing the hCMV-gag transgene in the El antiparallel 

30 orientation was digested with Ascl and Bgll 1 to remove the hCMV-gag portion of the 
transgene. The mCMV promoter (AscllBglll digested PCR product) was inserted 
into the shutde vector in a directional manner. The vector was then digested with 
Bglll and the gag reporter gene (Bglll fragment) was re-inserted. Several clones 
were screened for correct orientation of the reporter gene. For each of the full length 

35 lA pol and fiill length nefi^G2A,LLAA genes, cloning was performed using the unique 
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Bgl n site within the mCMV-bGHpA shuttle vector. The pol and nef genes were 
excised ftom tbeir respective pVl Jns plasmids by Bgl H digestion. 

EXAMPLE 22 

5 Construction of mCMV Rill Length Inactivated Pol and 

Full Length ne^G2AJJLAA Adenovectors 
Each of these transgenes of Example 21 were inserted into the modified 
shutfle vector in both the El parallel and El anti-parallel orientations. Pacl and 
BsfZl 101 digestion of each shuttle vector was perfonned and each specific transgene 

10 fragment containing the flanking Ad5 sequences was isolated and co-transformed 
with Cla I digested MRKpAd5(B3+) or MRKpAd5(E3-) aderiovector plasmids via 
bacterial homologous recombination in BJ5183 E. coli cells. Recombinant pre- 
plasmid adenovectors containing the various transgenes in both the E3- and E3+ 
versions (and in the El parallel and El antiparallel orientations) were subsequentiy 

15 prepared in large scale following transformation into XL-1 Blue E. coli cells and 
analyzed by restriction analysis and sequencing. 



20 



EXAMPLE 23 
Construction of hCMV-tpa-nef (LLAA) Adenovector 
The tpa-nef gene was ampUfied out from GMP grade pVlJns-tpanef (LLAA) vector 
using the primer sets: Tpanef (BamHD F S'-ATT GGATCC ATG GAT GCA ATG 
AAG AGA GGG (SEQ ID 24); Tpanef (Bamffl) R 5'-ATA GQATCQ TTA GCA 



GTC err GTA GTA CTC G (SEQ ID NO:25). The resulting PGR product was 
digested with Bainm, gel purified and cloned into the Bgl H site of MRKAdSCMV- 

25 bGHpA shuttle vector {Bgl H digested and calf intestinal phosphatase treated). 

aones containing the tpanef CLLAA) gene (see SEQ ID NO: 15 for complet coding 
region) in the correct orientation with respect to the hCMV promoter were selected 
foUowing Sea I digestion. The resulting MRKAd5tpanef shuttle vector was digested 
with Pac I and Bst Zl 101 and cloned into the E3+ MRKAdS adenovector via bacterial 

30 homologous recombination techniques. 

EXAMPUE24 

Immunogenicity of MRKAdSpol and MRKAd5nef Vaccine 
Materials and Methods - Rodent Immunization - Groups of N=10 BALB/c 
35 mice were immunized i.m. with the following vectors: (1) MRKAd5hCMV-IApol 
(E3+) at either 10^7 vp and 10^9 vp; and (2) MRKAdShCMV-IApol (E3-) at either 
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10^7 vp and 10^9 vp. At 7 weeks post dose, 5 of the 10 mice per cohort were boosted 
with the same vector and dose they initially received At 3 weeks post the second 
does, sera and spleens were collected from aU the animals for RT EUS A and IFNg 
ELIspot analyses, respectively. For all rodent immunizations, the Ad5 vectors were 
5 diluted in 5 mM Tris, 5% sucrose, 75 mM NaCl, 1 mM MgC12, 0.005% polysorbate 
80, pH 8.0. The total dose was injected to both quadricep muscles in 50 pL aliquots 
using a 0.3-mL insulin syringe with 28-1/2G needles (Becton-Dickinson, Franklin 
Lakes, NJ). 

Groups of N=10 C57/BL6 mice ware immunized i.m. with the following 
10 vectors: (1) MRKAd5hCMV-nef(G2AJXAA) (E3+) at eidier 10^7 vp and 10^9 vp; 
(2) MRKAd5mCMV-nef(G2A,IJLAA) (E3+) at either 10^7 vp and 10^9 vp; and (3) 
MRKAd5ma\lV-tpanef(IXAA) (E3+) at either 10^7 vp and 10^9 vp. At 7 weeks 
post dose, 5 of die 10 mice per cohort were boosted with the same vector and dose 
they initially received. At 3 weeks post the second does, sera and spleens were 
15 collected horn all the animals for RT ELISA and IFNg ELIspot analyses, 
respectively. 

Non-human Primate immunization - Cohorts of 3 rfiesus macaques (2-3 kg) 
were vaccinated with the following Ad vectors: (1) MRKAd5hCMV-IApol (E3+) at 
either 10^9 vp and 10^11 vp dose; and (2) MRKAd5hCMV-IApol (E3-) at either 

20 10^9 vp and 10^1 1 vp; (3) MRKAd5hCMV-nef (G2A JLLAA) (E3+) at either 10^9 vp 
and 10^11 vp; and (4) MRKAd5m(3^-nef(G2AJXAA) (E3+) at either 10'^9 vp and 
10^11 vp. The vaccine was administered to chemically restrained monkeys (10 
mg/kg ketamine) by needle injection of two 0.5 mL aliquots of the Ad vectors (in 5 
mM Tris, 5% sucrose, 75 mM NaCl, 1 mM MgCla, 0.005% polysorbate 80, pH 8.0) 

25 into both deltoid muscles. The animals were immunized twice at a 4 week interval 
(T=0, 4 weeks). 

Murine anti-RTand anti-nef ELISA - Anti-RT titers were obtained following 
standard secondary antibody-based ELLS A. Maxisorp plates (NUNC, Rochester; NY) 
were coated by overnight incubation with 100 |iL of 1 Jig /mL HIV-1 RT protein 

30 (Advanced Biotechnologies, Columbia, MD) in PBS. For anti-nef ELISA, 100 uL of 1 
ug/mL HIV-1 nef (Advanced Biotechnologies, Columbia, MD) was used to coat the 
plates. The plates were washed with PBS/0.05% Tween 20 using Titertek MAP 
instrument (Hunstville, AL) and incubated for 2 h with 200 |Jl/well of blocking 
solution (PBS/0.05% tween/1% BSA). An initial serum dilution of 100-fold was 

35 performed followed by 4-fold serial dilution. lOO-jiL aliquots of serially diluted 
samples were added per well and incubated for 2 h at room temperature. The plates 
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were washed and 100 jiL of yiOOO-dUuted HRP-rabbit anti-mouse IgG (ZYMED. 
San Francisco, CA) were added with 1 h incubation. The plates were washed 
thoroughly and soaked with 100 piL 1,2-phenylenediamine dihydiochloiide/hydrogen 
peroxide (DAKO, Norway) solution for 15 min. The reaction was quenched by 

5 adding 100 nL of 0.5M H2SO4 per well. OD492 readings were recorded using Titertek 
Multiskan MCa340 with S20 stacker. Endpoint titers were defined as the highest 
serum dilution that resulted in an absorbance value of greater than or equal to 0.1 
OD492 (2.5 times tiie background value). 

Non-human primate and murine EUspot assays - The enzyme-linked 

10 immuno-spot (EUSpot) assay was utilized to enumerate antigen-specific INP/- 
secreting cells from mouse spleens (Miyahira, et al.l995, J. Immunol Methods 
181:45-54) or macaque PBMCs. Mouse spleens were pooled from 5 mice/cohort and 
single ceU suspensions were prepared at 5xlO'^/mL in complete RPMI media 
(RPMI1640, 10% FBS, 2nM L-glutamine, lOOU/mL Penicillin, 100 u/mL 

15 stoeptomycin. 10 mM Hepes, 50 uM P-ME). Rhesus PBMCs were prepared from 8- 
15 mL of heparinized blood following standard RcoU gradient separation (Coligan, et 
al, 1998, Current Protocols in Immunology. John Wiley & Sons, Inc.). Multiscreen 
opaque plates (MilUpore, France) were coated with 100 pIVwell of either 5 jig/mL 
purified rat anti-mouse IFN-y IgGl, clone R4-6A2 (Phamiingpn, San Diego, CA), or 

20 15 ug/mL mouse anti-human IFN-Y IgG2a (Cat. No. 1598-00, R&D Systems, 
MinneapoUs. MN) in PBS at 4°C overnight for murine or monkey assays, 
respectively. The plates were washed witii PBS/peniciDin/stiBptomydn and blocked 
witii 200 pUweU of complete RPMI media for 37 °C for at least 2 h. 

To each well, 50 nL of ceU samples (4-5x10* cells per well) and 50 ML of the 

25 antigen solution were added. To the control well, 50 pL of the media containing 

DMSO were added; for specific responses, either selected peptides or peptide pools (4 
ug/mL per peptide final concentiiation) were added. For B ALB/c mice immunized 
with the pol constructs, stimulation was conducted using a pool of CIM^-epitope 
containing 20-mer peptides (aa21-40, aa411-430, aa641-660, aa731-750. aa771-790) 

30 or a pool of CD8*-epitope containing peptides (aa201-220, aa311-330, aa781-800). 
For C57/BL6 mice immunized with the nef consttuct, either aa51-70 (CDS* T cell 
epitope) or aaSl-lOO (CD4-^ peptide derived from tiie nef sequence was added for 
specific stimulation. In monkeys, tfie responses against pol were evaluated using two 
pools (L and R) of 20-aa peptides that encompass the entire pol sequence and overlap 

35 by 10 amino acids. In monkeys vaccinated wifli the nef constrocts, a single pool 
containing 20-mer peptides covering tiie entire HIV-1 nef sequence and overlapping 
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by 10 aa was used. Each sample^antigen mixture was performed in triplicate wells for 
murine samples or in duplicate wells for rhesus PBMCs. Plates were incubated at 
ST'^C, 5% CO2, 90% humidity for 20-24 h. The plates were washed with PBS/0.05% 
Tween 20 and incubated with 100 pL/well of either 1.25 jig/mL biotin-conjugated rat 
5 anti-mouse IFN-y mAb, clone XMG1.2 (Phanningen) or of 0.1 ug/mL biotinylated 
anti-human IFN-gamma goat polyclonal antibody (R&D Systems) at 4®C overnight. 
The plates were washed and incubated with 100 (jlVwell 1/2500 dilution of 
strepavidin-aUcaline phosphatase conjugate (Pharmingen) in PBS/0.005% Tween/5% 
FBS for 30 min at 37 °C. Spots were developed by incubating with 100 pL/well 1- 

10 step NBT/BCIP (Pierce Chemicals) for 6-10 min. The plates were washed with water 
and allowed to air dry. The number of spots in each well was determined using a 
dissecting microscope and the data normalized to 10^ cell input. 

Non-human Primate anti-RTEUSA - The pol-specific antibodies in the 
monkeys were measured in a competitive RT EIA assay, wherein sample activity is 

15 detetmined by the ability to block RT antigen from binding to coating antibody on the 
plate well. Briefly, Maxisorp plates were coated with saturating amoxmts of pol 
positive human serum (#971 11234). 250 uL of each sample is incubated with 15 uL 
of 266 ng^mL RT recombinant protein (in RCM 563. 1% BSA, 0.1% tween, 0.1% 
NaNa) and 20 uL of lysis buffer (Coulter p24 antigen assay kit) for 15 min at room 

20 temperature. Similar mixtures are prepared using serially diluted samples of a 

standard and a negative control which defines maximum RT binding. 200 uL/weU of 
each sample and standard were added to the washed plate and the plate incubated 16- 
24 h at room temperature. Bound RT is quantified following the procedures described 
in Coulter p24 assay kit and reported in miUiMerck units per mL arbitrarily defined 

25 by the chosen standard. 

Results ' Rodent Studies - B ALB/c mice (n=5 mice/cohort) were immunized 
once or twice with varying doses of MRKAd5hCMV-lApolCE3+) and 
MRKAd5hCMV-IApol(E3-). At 3 weeks after the second dose, Anti-pol IgG levels 
were determined by an ELIS A assay using RT as a surrogate antigen. Cellular 

30 response were quantified via DPNy ELISpot assay against pools of pol-epitope 
containing peptides. The results of these assays are sununarized in Table 10. The 
results indicate that the mouse vaccinees exhibited detectable anti-RT IgGs with an 
adenovector dose as low as 10'^7 vp. The humoral responses are highly dose- 
dependent and are boostable with a second immunization. One or two doses of either 

35 pol vectors elicit high ftequencies of antigen-specific CD4'** and CDS**" T cells; the 
responses are weakly dose-dependent but are boostable with a second immunization. 
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Table 10. ImmunogeDicity of MRKAdSpol Vectors in BALB/c mice. 



GUroup 



Vaccine 



MRKAdShCMVFLpol (E34-) 



MRKAdShCMVFLpol (E3+) 



MRKAdShCMVFLpol (E3-) 



Dose 



10^7 vp 



i(y^ vp 



10^7 vp 



Antt-RT IqG Ttters^ 



No. of 
Doses 



GMT 



Z 
1 



310419 
819 



2 
1 



163840r 
713165 



2 
1 



310419 
6400 



+SE 



301785 
372 



0 

528520 



3B6218 
14013 



153020 
265 



0 

303555 



172097 
4393 



SFai0*6 cells* 



Medium 



1(1) 
1(1) 



2(2) 
1(1) 



0(0) 
10(8) 



C04+ 
peptide 

pool 



76{4) 
72(9) 



114(9) 
48(7) 



223(7) 
141^) 



CD8+ 
peptide 
JB3SL 



2313(67) 
533(41) 



2063(182) 
733(89} 



2S07(Z7) 
409(28) 



MRKAdShCMVFLpol (E3-) 



Naive 



10^ vp 



none 



2 

1 

nana 



1638400?' 
1241675^ 

"57 



0 

396725 

T 



0 

300661 



1(1) 
0(0) 



160(13) 
39(13) 



2385(11) 
833(83) 



•GMT. geometric mean titer oi the cohort of 5 mice; SE, stancfaird error of the gemelrfc mean 
*Near*or at the upper Dmtt of the serial dilution; hence, could be greater than this value 

»Na of Spot-forming Cells per mDBon splecnoytes; mean values of triplicates are reported along wHh slandaid errors In parenttiesis. 

C57/BL6 mice were immunized once or twice with varying doses of 
N4RKAd5hCMV-nef(G2AJXAA) (E3+), MRKAd5mCMV-nef(G2A,LLAA) (E3+) 
at either 10^7 vp and(3) MRKAd5mCMV-tpanef(LLAA) (E3+) at either 10^7 vp and 
10^9 vp. The immune response were analyzed using similar protocols and the results 
are listed in Table 1 It While anti-nef IgG responses could not be detected in this ~ 
model system with any of the constructs, there are strong indications of a cellular 
inmiunity generated against nef using the ELIspot assay. 



Table 11. Irmnunogenicity of MRKAdSnef Vectors in CS7/BL6 mice. 



Group 



Vaccbie 



No. of 
Doses 



Anfl-nef IgG Titers" 



GRIT 



SE 



SFC/IO^ cells' 



Medium 



a^WO 
CM* 



aa81>100 
CD4+ 



3 



MRKAdShCMVFUief (E3f) 



10«7vp 



2 
1 



174 
132 



70 
42 



50 
32 



1(1) 
0(0) 



MRKAdShCMVFLnef (Eav) 



10^ vp 



2 
1 



174 
132 



70 
42 



50 
32 



0(0) 
1(1) 



MRKAdSmCMVFLnef (E3+) 



10^7 vp 



2 
1 



132 

lis 



42 
46 



32 
33 



3(1) 



23(1) 
0(0) 



61(7) 
62(7) 



15(5) 

3(2) 



1(1) 
0(0) 



4(2) 
3(1) 



5(2) 
4(2) 



6 



MRKAdSfflCMS/FLnef (E3f) 



10^ vp 



2 
1 



132 
132 



42 
42 



32 
32 



4(2) 
2(1) 



MRKAd5mCMVIpanef(E3f) 



10^7 vp 



2 
1 



132 
100 



42 
0 



32 
0 



3(2) 
3(1) 



MRKAd5mCMVtpanef(E34>) 



10^ vp 



2 
1 



230 
115 



170 
46 



98 
33 



3(2) 
7(1) 



83(13) 
29(2) 



14(2) 
13(4) 



145(29) 
151(14) 



5(1) 
4(0) 



5(1) 
10(3) 



4(0) 
10(0) 



NaOfve 



none 



none 



152 



78 



52 



21^ 



GMT geometric mean tiler of the cohort of 5 mice; SB, standard error of the gemelric mean 
*»Na of spoMomiing cells per miffion splecnoytes; mean values of triplicates are reported along with standard 



18(6) 



enors in parenthesis. 



26(3) 



Monkey Studies - Cohorts of 3 rhesus macaques were immunized with 2 doses 
of MRKAd5hCMV-IApol(E3+) and MRKAd5hCMV-IApolCE3-). The number of 
antigen-spedfic T ceUs (per million PBMCs) were enumerated using one of two 
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peptide pools (L and R) that cover the entire pol sequence; the results are listed in 
Table 12. Moderate-to-strong T cell responses were detected in the vaccinees using 
either constructs even at a low dose of 10^9 vp. Longitudinal analyses of the anti-RT 
antibody titers in the animals suggest that the pol transgene product is expressed 
efficiently to elicit a humoral response (Table 13). It would appear that generally 
higher ioomune responses were observed in animals that received the E3- construct 
compared to the E3+ virus. 

Table 12. Pol-specific T Cell Responses in MRKAdSpol Immunized Rhesus 



Macaques, 



Vaccine (Ts0,4 wks) 


Monk# 


Preblee<l 




T=4 


T=7 




Ts16 








Mock 


PolL 


PdR 


Mock 


PolL 


PoIR 


Mock 


PolL 


PoIR 


Mock 


PolL 


POlR 




99a 00 


1 


0 


0 


1 


38 


31 


0 


52 


146 


0 


49 


715 


10^11 vp 


99C215 


1 


2 


2 


10 


98 


249 


1 


109 


305 


22 


68 


250 


99D201 


5 


5 


4 


6 


149 


95 


0 


40 


35 


0 


35 


18 


M?KAd5hCW-l/«Xl(^3+) 


99D212 


0 


2 


0 


4 


331 


114 


0 


58 


14 


0 


6 


6 




99D180 


0 


4 


2 


0 


19 


192 


4 


36 


156 


5 


38 


106 


. 99C201 


B 


5 


21 


6 


62 


62 


0 


18 


32 


1 


14 


65 


MRKAt£ha^-IApG((i3-> 


99D239 


5 


2 


2 


20 


82 


172 


1 


68 


114 


9 


21 


40 


lo^n VP 


99C186 


4 


12 


6 


5 


120 


421 


2 


271 


48d 


16 


875 


530 


99C084 


1 


8 


0 


8 


84 


464 


0 


14 


236 


1 


24 


264 


MRKAcfihCMV-IApclCpS') 


cac 


10 


10 


8 


12 


724 


745 


4 


322 


376 


A 


188 


176 


1(K9vp 


CD16 


2 


0 


1 


5 


474 


468 


0 


232 


212 


0 


101 


121 


CX>1I 


6 


6 


12 


10 


98 


110 


5 


60 


80 


8 


25 


34 


Ntfve 


0B3Q 


nd 


nd 


nd 


nd 


nd 


nd 


4 


2 


2 


2 


1 


2 



nd,noldatamilned 

Rsportsd are 8FC per RAlton PBMC^ mean d (tajpScait wdte. 



Table 13. Anti-RT Ig Levels in MRKAd5pol Immunized macaques. 



RT ANT IBODY >i£S AY T ITERS IN n«AUAnL 








Voodne^tonkev T aa 




T=7 


Ts12 


Ts16 


MRKAd5hCMV-IADOl(E3+). 10^11 VD 










99C100 


61 


1999 


5928 


4768 


99C216 


81 


1641 


2356 


2767 


99D201 


53 


336 


539 


387 












MR K AdShCMV-IADOlCE 3+). 1 0'^9 VD 










99D21 2 


10 


40 


49 


68 


99D180 


<10 


36 


79 


93 


99C201 


<10 


37 


71 


76 












MRKAdShCMV-IADOI(E3-). lO'^ll VD 










99D239 


44 


460 


1234 


1015 


99C186 


21 


233 


480 


345 


99CD84 


235 


2637 


2858 


1626 












MR KAd5hCMV-iADOl(E 3--). 1 0'^ VP 










0C7C 


32 


176 


306 


235 


CDIG 


20 


140 


273 


419 


con 


15 


112 


149 


237 
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When rhesus macaques were immunized i,m. with two doses of MKKAdSnef 
constructs, vigorous T ceU responses ranging from 100 to as high as 1100 per million 
were observed in 8 of 12 vaccinees (Table 14), The efficacies of the mCMV- and 
hCMV- driven nef constructs are comparable on the basis of the data generated thus 

far. 

Table 14. Nef-spedfic T cell Responses in MRKAdSnef Immunized Rhesus 




MRKAd5hCMV-nef(Q2A.LLA/^ (E3+) 
10^11 vp 



MBKAdShCMV-nef(Q2A,LLAA) (E3+) 
1(y« vp 



CC2K 
CD15 
CD16 



MRKAd5mCMV-nef (G2A,LLAA) (ESf ) 
10^11 vp 



MRKAd5mChflV-nef(G2A.lXAA) (E3fr)- 
10«9vp 



990191 
99D144 
99C193 

99D224. 

99D2S0 

990120 



Naive 



083Q 



9 
5 
6 



1 
4 
1 

T- 
e 
1 



nd 



9 
4 
1 



5 
6 
2 

_tl 
9 
6 



nd 



6 

30 

6 



4 
5 
1 

l4- 
4 
20 



16 



52 

998 

1146 



614 
434 
58 

108 
299 



22 



0 
2 
0 



0 
0 

1 

T 

0 
0 



35 
566 
369 



298 
1100 
22 . 

125 
54 
92 



0 
0 
1 



2 
2 
0 

"o 

0 
0 



15 
434 
212 



419 
932 
64 

to" 

5 

79 



EXAMPLE 25 

15 Comparison of Clade B vs. Clade C T Cell Responses in HIV-infected Subjects 
PBMC sam4>les collected from two dozens of patients infected with fflV-l in 
US were tested in EUSPOT assays with peptide pools of 20-mer peptides overlapping 
by 10 amino acids. Four different peptide pools were tested for cross-clade 
recognition, and they were either derived from a clade B-based isolate (gag H-b; nef- 

20 b) or a clade C-based isolate (gag H-c, nef-c). Data in Table 15 shows that T ceUs 
from these patients presumably infected with clade B HIV-1 could recognize clade C 
gag and nef antigens in ELISPOT assay. Correlation analysis further demonstrated 
that these T cell responses against clade C gag peptide pool were about 60% of the 
clade B counterpart (Figure 24), while tiie T ceU responses against clade C nef were . 

25 about 85% of die clade B counterpart (Figure 25). These results suggest tiiat cellular 
immune responses generated in patients infected witii clade B HIV-1 can recognize 
gag and nef antigens derived from clade C HIV-L These data show that a HIV 
vaccine, such as a DNA or MRKAd5-based adenoviral vaccine expressing a clade B 
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gag and/or nef antigen will potentially have the ability to provide a prophylactic 
and/or therapetic advantage on a global scale. 



5 Table 15 

Responses Shown as the Number of gBFN-Secreting T Cells per Million 



PBMCs 



subiect 


bleed date aag epitope # I mock 


gag H-b 


gagH-c 


nef-b 


nef-c 






from mapping) 










#100 


I9-Jul-99 


12 


10 


3950 


1385 


1295 


1300 


#101 


25-JUI-99 


3 


15 


3885 


1280 


na 


1020 


#102 


25-JUI-99 


4 


15 


1740 


850 


1255 


1785 


#104 


7-Jun-99 


2 


5 


1355 


1185 


na 


1060 


«107 


n-Oct-99 


2 


25 


3305 


2795 


670 


870 


#405 


n-Jul-99 


2 


15 


4575 


3180 


1700 


1500 


#501 


I9-JUI-99 


2 


15 


1100 


570 


3365 


3460 


#505 


18-JUI-99 


5 


10 


2145 


1725 


1235 


no 


«506 


28-Feb-99 


2 


25 


150 


45 


400 


610 


«701 


28-Mar-99 


5 


30 


7620 


4775 


3320 


2780 


#709 


17-Mciy-99 


3 


15 


2785 


1945 


1090 


1630 


#710 


24-May-99 


4 


5 


1055 


1080 


2210 


2140 



















10 EXAMPLE 26 

Characterization and Production of MRKAdSpol and MElKAdSnef 

Vectors in Roller Bottles 

Expansion of nef and pol Adenovectors - Nef and pol CsCl purified MRKAdS 
15 seeds were used to infect roller botties to produce P4 virus to be used as a seed for 
further experiments. P4 MRKAd5 pol and nef vectors were used to infect roller 
botties at an MOI 280 vp/cell, except for hCMV-tpa-nef [E3+] which was infected at 
an MOI of 125 due to low titers of seed obtained at P4. 

20 Table 16 Viral particle concentrations for P5 nef and pol adenovectors 



Adenovecjtor 


AEX Titer 
(10^° vp/ml culture) 


AEX Titer 
(iCvp/cell) 


Amplification 
Ratio 


hCMV-FL-nef [E3+] 


1.1 


0.9 


30 


mCMV-FL-nef [E3+] 


2.2 


2.1 


75 


hCMV-tpa-nef [E3+] 


0.07 


0.1 


5 


mCMV-tpa-nef [E3+] 


1.3 


0.9 


35 


hCMV-FL-pol IE3+] 


2.7 


2.1 


75 


hCMV-EL-pol [E3-] 


1.9 


1.3 


45 
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RoUer Bottle Passaging - Passj^ng of thepoZ and nef^constructs continued 
through passage seven. CeH-asscciated (fteeze/Aaw lysis) and whole broth (triton- 
lysis) titers obtained in all passages were very consistent In general, MRKAdSpoI is 
ca. 70% as productive as MREAdSgag while MRKAdSnef is ca. 25% as productive 
as MRJCAdSgag. Samples of P7 virus fa: both constructs were analyzed by V&CB by 
restriction digest analysis and did not show any rearrangements. 

Table 17. Pass age Six Viral Productivity for MRKAdS pol and MRKAdSnef 



hCMV-FL-nef [B34-1 pool 



lCMV-EUpollE3+-] pool 



Xviable(10* cells/m^, 
Viabifity(%) 



jnfeciiDii 



1.22, 85% 



Harvest 



0.99.62% 
1.10,72% 



1.22.70% 
1.42,74% 



Cell Passage 
Number 



AEX Titer 
(Cell Associalftft) 



62 



OS 




Ittar 
lO'vp/toell 



0.7 



Katio 



25 



Triton Lysis TitHr 
lO"* vpftnl culture 



15 Table 18. Passage Seven Viral ProductiTity for MRKA d5pol and MRKAdSnef 

Lj j.<iuic xw. , ^ . 1 r>_«n t m-ntm- I Titer I AmDlification 




20 



25 



MRKAdSnef and MRKAd5pol Viral ProducHan Kinetics - A timecourse 
experiment was carried out in roller bottles to detennine if the viral production 
kinetics of the MRKAdSpol and MRKAdSnef vectors were similar to those of 
MRKAdSgag. PER.C6® cells m loUer botfle cultures were infected at an MOI of 280 
vp/ceUs with P5 MRKAd5pol, PS MRKAdSnef and P7 MRKAdSgag; for each 
adenovector, two infected bottles were sanapled at 24, 36, 48, and 60 hours post 
infection. In addition, two bottles w©ce left unsampled until 48 hpi when they were 
harvested undca: the Phase I process conditions. The anion-exchange HPLC viral 
particle concentrations of the ficeeze-thaw recovered cell associated virus at the 24. 36. 
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48, and 60 hpi timepoints are shown in Figure 29A-B. The QPA titers show a similar 
trend (data not shown). 

Comparison ofhCMV- and mCMV-FLrnef- As the Uteis obtained with the 
5 MKKAd5ne£ construct QiCMV-FL-nef) were lower than those obtained with 
MRKAdSjgag or MRKAdSpol, a viral productivity comparison experiment was 
performed with mCMV-FL-nef . For each of the two adcnovcctors (hCMV- and 
mCMV-FL-neO, two roller bottles were infected at an MOI of 280 vp/cell with 
passage five clarified lysate. The macroscopic and microscopic observations of the 
10 four roller bottles were identical at the time of harvest. Analysis of the clarified iysate 
produced indicated a higher viral particle concentration in the bottles infected with 
mCMV-FL-nef, as shown in Table 19. It is stipulated that the higher productivit>' with 
mCMV promoter driven nef vector is due to lower nef expression levels in PER.C6 
cells- experiments are underway at V&CB to measure nef expression levels. 

15 

Table 19. Passage Six Viral Productivity Comparison of hCMV- and mCMV-FL-nef 







Xv ceUs/jDol), Viabifity (%) 


Cell Passage 


ABX Titer 


Tter 


Ampllficailoc 


Tdum Lysis Titer 






bficctioD 


Harvest 


Nusibec 


lO'^vp^oultin 




Rado 


lO'^vp/ml culture 


hCMV-PL-nef 


Pbol 


1.11,91% 




60 




1.4 


SO 


2.8 


(MRKAdSnef) 


1 

2 




1^,75% 

134,74% 












mCMV-FL-nef 


Pool 


Ul,91% 




60 


2,3 


2.1 


75 


4.6 




1 


















2 




1.18.77* 













20 EXAMPLE27 

Characterization and Large Scale Production of 
MRKAdSnef Virus in Bioreactors 
Materials and Methods - The experiment of the present example was run twice 
imder the following conditions: 36.5®C, DO 30%, pH 7.30, ISOrpm agitation rate, 
25 no sparging, life Technologies (Qibco, Invitrogen) 293 SFM n (with 6mM L- 

glutamine), 0.5M NaOH as base for pH control. During the first run (8200101 15). 
two lOL stirred vessel bioreactors were inoculated wifli PER.C6® cells at a 
concentration of 0.2x10^ cells/ml. Cells were grown until they reached a cell 
concaitration of approximately IxlO'' cells/ml. The cells were infected with uncloned 
30 MRKAdSnef (G2 A^LLAA) at a MOI of 280 virus particles (vp)/cell. For the second 
batch (B20010202), the same procedure as the first run was used, except the cells 
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10 



15 



20 



were infected with cloned MRAdSnef. During both runs, the bioieactors were 
harvested 48 hours post-uifection. Samples were taken and virus concentrations were 
detennined ftom whole broth (with tiiton lysis), supernatant, and cell peUets (3 X 
freeze/thaw) with the AEX and QPA assays. Metabolites were measured with 
BioProfile 250 throughout the process. 



Table 20: Experimmtal ConcMtions 




Table 21: Virus source used for experiments. 



Run 


Batch ID 


aonedAJncloned 
MRJKAdSnef 


MOl 
(vp/cells) 


#1 


B20010115-1 


Uncloned 


280 


B20010115-2 


Uncloned 


280 


#2 


B20010202-1 


Cloned 


280 


B20010202-2 


Cloned 


280 



Results " Table 22 and 23 show an the abiUty to scale up pioduction of 
MRKAdSnef by growth in a bioreactor. 

Table 22: Virus Concentration as measured by the AEX assay 



Vims Concentration @ 48hpi dxlO" vp^ 




Table 23: Virus Titers as measured by the QPA assay 




Run 



#1 



#2 



Batch ID 



B20010H5-1 



B20010115-2 



B20010202-1 



Qoned/Uncloned 
NdRKAdSnef 



Uncloned 



Uncloned 



Cloned 



Whole 
Broth 



0.13 



0.14 



0.14 



Supernatant 



1.12 



Clarified 

Lysate 



1.76 



0.73 



1.54 



0.97 



1.62 




Total 



Triton 
Lysate 



2.88 



11.28 



2.27 



5.86 



2.69 



11.89 



The present invention is not to be limited in scope by the specific 
embodiments described herein. Indeed, various modifications of the invention in 
addition to those described herein wUl become apparent to those skilled in the ait 
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from the foregoing description. Such modifications are intended to fall within the 
scope of the appended claims. 

EXAMPLE 28 

5 MRKAd5HIV-lgag Boosting of DNA-Primed Animals 

Groups of 3-5 rhesus macaques were immunized with (a) 5 mgs of VI Jns- 
Hgag (pVUnsCMV(no intron)-FL-gag-bGHpA), (b) 5 mgs of VlJns-Hgag 
formulated with 45 mgs of a non-ionic block copolymer CRL1005, or (c) 5 mgs of 

10 VlJns-Flgag formulated with 7.5 mgs of CRL1005 and 0.6 mM benzalkonium 
chloride at weeks 0, 4, and 8. All animals received a single dose of 10e7 viral 
particles (vp) of the MRKAdSHTV-lgag at week 26. Note: lOe? is too low to prime 
or boost effectively when used as a single modality (dose is selected to mimic 
preexposure to adenovirus); see Figure 32. 

15 Blood samples were collected from all animals at several time points and 

peripheral blood mononuclear cells (PBMCs) were prepared using standard FicoU 
method. The PBMCs were counted and analyzed for gamma-interferon secretion 
using the ELISpot assay (Table 24). For each monkey, the PBMCs were incubated 
overnight either in the absence (medium) or presence of a pool (called "gag H") of 50 

20 20-aa long peptides that encompass the entire HIV-1 gag sequence. 

The results indicate that MRKAd5HIV-lgag was very effective in boosting 
the T cell immune responses in these monkeys. At week 28 or 2 weeks after the viral 
boost, the number of gag-specific T cells per million PBMCs increased 2-48 fold 
compared to the levels observed at week 24 or 2 weeks prior to the boost. 

25 The PBMCs were also analyzed by intracellular gamma-interferon staining 

prior to (at week 10) and after the MElKAd5gag boost (at week 30). The results for 
select animals are shown on Figure 31. The results indicate that (a) inomunization 
with DNA/adjuvant fonnulation elicited T cell responses which can either be 
balanced, CD4'^-biased or CD8'*'-biased, and (b) boosting with the MRKAdSgag 

30 construct produced in all cases a strongly CD8'*"-biased response. These results 

suggest that boosting with MRKAd5HIV-lgag construct is able to improve the levels 
of antigen-specific CD8'*' T cells. 
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EXAMPLE 29 

Construction of gagpol fusion for MRKAdSgagpol fusion constructs 

The open reading frames for the codon-optimized HIV-1 gag gene was fused 
5 direcdy to the open reading frame of the lA pol gene (consistmg of RT, RNAseH and 
integrase domains) by stepwise PGR. Because the gene (SEQ ID NO: 38) does not 
include the protease gene and the frameshifl sequence, it encodes a single polypeptide 
of the combined size of p55. RT, RNAse H and integrase (1350 amino acids; SEQ E) 
NO: 39). 

10 The jfragment that extends from the BstEII site within the gag gene to the last 

non-stop codon was ligated via PGR to a fragment that extends from the start codon 
of the lApol to a unique BamHZ site. This fragment was digested with BstE// and 
BamHZ. Construction of gag-IApol fusion was achieved via three-fragment ligation 
involving the Pst/-BstEff gag digestion fragment, the BstEZZ/BamHZ digested PGR 

15 product and long PstZ/BamH/ VlR-FLpol backbone fragment. 

The MRKAdS-gs^pol adenovirus vector was constructed using the Bg}// 
fragment of the VlR-gagpol containing the entire ORE of gag-IApol fusion gene. 

EXAMPLE 30 

20 firmunogenicity Studies in Non-Human Primates 

Cohorts of three (3) macaques were immunized with lOeS or lOelO viral 
particles (vp) of one of the foUowmg MRKAdS HIV-1 vaccines: (1) MRKAdSgag; 
(2) MElKAd5pol; (3) MRKAd5nef ; (4) a mixture containing equal amounts of 
25 MRKAdSgag, MRKAdSpol, and MRKAdSnef , or (5) a mixture of equal amounts of 
MRKAd5gagpol and MRKAd5nef . The vaccines were administered at weeks 0 and 
4. 

The T cell responses against each of the HIV-1 antigens were assayed by IFN- 
gamma ElJSpot assay using pools of 20-aa peptides that encompass the entire protein 

30 sequence of each antigen. The results (Table 25) are expressed as the number of spot- 
forming cells (sfc) per million peripheral blood mononuclear cells (PBMC) that 
respond to each of the peptide pools. 

Results indicate the following observations: (1) each of the single gene 
constructs (MRKAdSgag, MRKAdSpol, or MRKAdSnef) is able to elicit high levels 

35 of antigen-specific T cells in monkeys; (2) the single-gene MRKAdS constructs can 
be mixed as a multi-cocktail formulation capable of eliciting very broad T cell 
responses against gag, pol, and nef ; (3) the MRKAd5 vector expressing the fusion 
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protein of gag plus lA pel is capable of inducing strong T cell responses to both gag 
and pol. 



Table 25. Evaluation of Mixtures of MRKAdS vectors expressing humanized 
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Indicatsed are rwnibers of spot-forming cells per million PBMCS against the peptide pools. Mock, no peptides; gag 
H, fifty 20-aa peptides wicompassing p55 sequence; poU. 20-aa peptides representing N-teiminal half of lA pol; 
pol-2, 20-aa peptides leiaesenting the carboxy-teiminal half of lA pol; nef, 20-aa peptides encompassing die entire 
wild-type nef sequence. Responses to the antigens prior to the first immunization did not exceed 40 sfc/lO'^ 
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WHAT IS CLAIMED IS 

1 . A rccombinant adenoviral vaccine vector at least partially deleted in 
5 El and devoid of El activity, comprising: 

a) an adenovirus cw-acting packaging region corresponding to from 
about base pair 1 to between from about base pair 400 to about 
base pair 458 of a wildtype adenovirus genome; and 

b) a gene encoding an HIV protein or immunologically relevant 
10 modification thereof. 

2. A vector in accordance with claim 1 comprising a packaging region 
corresponding to from about base pair 1 to about base pair 450 of a wildtype 
adenovirus genome. 

3. A vector in accordance with claim 1 further comprising nucleotides 
15 corresponding to between from about base pair 351 1 to about 3524 to about base pair 

5798 of a wildtype adenovirus genome, 

4. A vector in accordance with claim 3 comprising base pairs 
corresponding to 1-450 and 3511-5798 of a wildtype adenovirus genome. 

5. A vector in accordance with claim 4 which is deleted of base pairs 

20 451-3510. 

6. A vector in accordance with claim 1 which is at least partially 

deleted in E3. 

7. A vector in accordance wida claim 6 wherein the E3 deleted region 
is from base pairs 28,133-30,818. 
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8. A vector in accordance with claim 1 wherein the gene encoding the 
mV protein or modifiication thereof comprises codons optimized for expression in a 
human. 

9. A vector in accordance with claim 1 wherein the vector comprises a 
5 gene expression cassette comprising: 

a) a nucleic acid encoding a protein; 

b) a heterologous promoter operatively linked to the nucleic acid 

encoding the protein; and 

(c) a transcription teimination sequence. 

10 10. A vector in accordance with claim 9 wherein the gene expression 

« 

cassette is inserted hito the El region. 

11 ."An adenoviral vector in accordance with claM 9 wherein the gene 
expression cassette is in an El parallel orientation 

12. An adenoviral vector in accordance with claim 9 wherein the gene 
15 expression cassette is in an El antiparallel orientation. 

13. An adenoviral vector in accordance with claim 9 wherein the 
promoter is a cytomegalovirus promoter devoid of intronic sequences. 

14. An adenoviral vector in accordance with claim 13 wherein the 
promoter is an immediate early human cytomegalovirus promoter. 

20 15. An adenoviral vector in accordance with claim 9 wherein the 

promoter is a murine cytomegalovirus promoter. 

16. An adenoviral vector in accordance with claim 9 wherein the 
transcription termination sequence is a bovine growth hormone polyadenylation and 
transcription termination sequence. 
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17. An adenoviral vector in accordance with claim 9 wherein the 
transcription termination sequence is a synthetic polyadenylation signal (SPA), 

18. A cell comprising die adenoviral vector of claim 1. 

19. Recombinant, rq>lication-defective adenovirus particles harvested 
5 and purified subsequent to transfection of the adenoviral vector of claim 1 into a cell 

line which expresses adaioviras El protein at complementing levels. 

20. An HIV vaccine composition comprising purified adenovirus 

particles of claim 19. 

21. An HIV vaccine composition of claim 20 which comprises a 

10 physiologically acceptable carrier. 

22. A method of producing recombinant, replication defective 
adenovirus particles containing the adenoviral genome of the adenoviral vector of 
claim 1 which comprises introducing the adenoviral vector into a host cell which 
expresses adenoviral El protein, and harvesting the resultant recombinant, 

15 replication-defective adenovirus. 

23. A method according to claim 22 wherein the cell is a PER.C6 

ceU. 

24. A method of generating a cellular-mediated immune response 
against HEV in an individual comprising administering to the individual a vaccine of 

20 claim 21. 

25. A method according to claim 24 which further comprises 
administration to the individual a DNA plasmid vaccine, optionally administered with 
a biologically effective adjuvant, protein or other agent capable of increasing the 
immune response. 
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26. A method according to claim 25 wherein the DNA plasmid 
vaccme is administered to the individual prior to administration of an adenovirus 
vaccine. 

27. A method according to claim 24 wheredn the adenovirus vaccine is 
5 preceded by an adenovirus vaccine of a different sa:otype. 

28. A method according to claim 24 which comprises administering 
andreadministering the adenovirus vaccine vector to the individual. 

29. An adenoviral vector in accordance with claim 1 wherein the HUV 

protein is HIV gag or an immunologically relevant modification thereof. 

30. An adenoviral vector in accordance with claim 9 wherein the gene 

expression cassette comprises an open reading frame encoding an HTV gag protein or 
immunologically relevant modification thojeof . 

3 1. A recombinant adenoviral vaccine vector at least partially deleted 

in El and devoid of El activity, comprising: 
15 a) an adenovirus cw-acting packaging region corresponding to fiom 

about base pair 1 to about base pak 450 and from about 351 1 to about 
5798 of a wildtype adenovirus genome, and deleted for base pairs 
corresponding to ftom about base pair 451 to from about base pair 
3510 of a wildtype adenovirus genome; and 
20 b) a g«ie expression cassette comprising 

i) SEQIDNO:29; 

ii) a heterologous promoter operatively linked to i); and 

iii) a transcription tramination sequence. 
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32. An adenoviral vector in accordance with claim 31 wherein the 
gene expression cassette is in an El parallel orientation. 

33 An adenoviral vector in accordance with claina 31 wherein the gene 
expression cassette is in an El antiparallel orientation. 

34. An adenoviral vector in accordance with claim 31 wherein the 
promoter is a cytomegalovirus promoter devoid of intronic sequences. 

35. An adenoviral vector in accordance with claim 31 wherein the 
transcription tennination sequence is a bovine growth hormone polyadenylation and 
transcription termination sequence. 

36. An adenoviral vector in accordance with claim 31 which is at least 
partially deleted in E3. 

37. A cell comprising the adenoviral vector of claim 30. 

38. Recombinant) replication-defective adenovirus particles harvested 
and purified subsequent to transfection of the adenoviral vector of claim 30 into a cell 
line which expresses adenovirus El protein at complementing levels. 

39. An HIV vaccine composition comprising purified adenovirus 

particles of claim 38. 

40. An HIV vaccine composition of claim 39 which comprises a 
physiologically acceptable carrier. 

41. A method of producing recombinant, replication defective 
adenovirus particles containing the adenoviral genome of the adenoviral vector of 
claim 30 which comprises introducing the adenoviral vector into a host cell which 
expresses adenoviral El protein, and harvesting the resultant recombinant, 
replication-defective adenovirus. 
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42. A method according to claim 41 wherein the cell is a PER,C6® 



cell. 



43. A method of generating a cellular-mediated immune response 
against HIV in an individual comprising administering to the individual a vaccine of 

5 claim 21. 

44. A method according to claim 43 which further comprises 
administration to the individual a DNA plasmid vaccine, optionally administered with 
a biologically effective adjuvant, protein or other agent capable of increasing the 



tiiii 



lune response. 

10 45. A method according to claim 44 wherein the DNA plasmid 

vaccine is administered to the individual prior to administration of an adenovirus 



vaccme. 

46. A method according to claim 43 wherein the adenovirus vaccine is 
preceded by an adenovirus vaccine of a different serotype. 
15 47. A method according to claim 43 which comprises administering 

and readministering the adenovirus vaccine vector to the individual. 

48. An adenoviral vector in accordance with claim 1 wherein the HIV 
protem is HIV pol or an hnmunologically relevant modification thereof. 

49. An adenoviral vector in accordance with claim 9 wherein the gene 
20 expression cassette comprises an open reading frame encodmg an HIV pol protein or 

immunologically relevant modification thereof. 

50. A recombinant adenoviral vaccine vector at least partially deleted 

in El and devoid of El activity, comprising: 
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a) an adenovirus cw-acting packaging region corresponding to from 

* 

about base pair 1 to about base pair 450 and firom about 351 1 to about 
5798 of a wildtype adenovirus genome, and deleted for base pairs 
conesponding to from about base pair 451 to from about base pair 
3510 of a wildtype adenovirus genome; and 

b) a gene expression cassette comprising 

i) a nucleotide sequence selected the group consisting of 
SEQIDNO: l,SEQIDNO:5andSEQIDNO:7; 

ii) a heterologous promoter operatively linked to i); and 

iii) a transcription termination sequence. 

51. An adenoviral vector in accordance with claim 50 wherein the 
gene expression cassette is in an El parallel orientation. 

52. An adenoviral vector in accordance with claim 50 wherein the 
gene expression cassette is in an El antiparallel orientation. 

53. An adenoviral vector in accordance with claim SO wherein the 
promoter is a cytomegalovirus promoter devoid of intronic sequences. 

54. An adenoviral vector in accordance with claim 50 wherein the 
transcription termination sequence is a bovine growth hormone polyadenylation and 
transcription termination sequence. 

55. An adenoviral vector in accordance with claim 50 which is at least 
partially deleted in £3. 

56. A cell comprising the adenoviral vector of claim 49. 
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57. Recombinant, replication-defective adenovirus particles harvested 
and purified subsequent to transf ection of the adenoviral vector of claim 49 into a cell 
line which expresses adenovirus El protein at complementing levels. 

58. An HIV vaccine composition comprising purified adenovirus 

5 particles of claim 57. 

59. An HIV vaccine composition of claim 58 which comprises a 

physiologically acceptable carrier. 

60. A method of producing recombinant, replication defective 
adenovirus particles containing the adenoviral genome of the adenoviral vector of 

10 claim 49 which comprises introducing the adenoviral vector into a host cell which 
expresses adenoviral El protein, and harvesting the resultant recombinant, 



replication-defective adenovirus. 

61. A method according to claim 60 wherein the cell is a PER.C6 

i 

cell. 

15 62. A method of generating a cellular-mediated immune response 

against HIV in an individual comprising administering to the individual a vaccine of 
claim 59. 

63. A method according to claim 62 which further comprises 
administration to the individual a DNA plasmid vaccine, optionaUy administered with 

20 a biologically effective adjuvant, protem or other agent capable of increasing the 

inunune response. 

64. A method according to claim 63 wherein the DNA plasmid 
vaccine is administered to the individual prior to administration of an adenovirus 
vaccine. 
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65. A method according to claim 62 wherein the adenovirus vaccine is 
preceded by an adenovirus vaccine of a different serotype. 

66. A method according to claim 62 which comprises administering 
and readministering the adenovirus vaccine vector to the individual. 

67. An adenoviral vector in accordance with claim 1 wherein the HIV 
protein is HIV nef or an inununologically relevant modification thereof. 

68. An adenoviral vector in accordance with claim 9 wherein the gene 
expression cassette comprises an open reading framt encoding an HTV nef protein or 
immunologically relevant modification thereof. 

69. A recombinant adenovkal vaccine vector at least partially deleted 
in El and devoid of El activity, comprising: 

a) an adenovirus cis-acting packaging region corresponding to from 
about base pair 1 to about base pair 450 and from about 35 11 to about 
5798 of a wildtype adenovirus genome, and deleted for base pairs 
corresponding to from about base pair 451 to from about base pair 
3510 of a wildtype adenovirus genome; and 

b) a gene expression cassette comprising 

i) a nucleotide sequence selected the group consisting of 
SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13 and 
SEQ ID NO: 15; 

ii) a heterologous promoter operatively linked to i); and 

iii) a transcription termination sequence. 

70. An adenoviral vector in accordance with claim 69 wherein the 
gene expression cassette is in an El parallel orientation. 
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7 1 . An adenoviral vector in accordance with claim 69 wherein the 
gene expression cassette is in an El antiparallel orientation. 

72. An adenoviral vector in accordance with claim .69 wherein the 
promoter is a cytomegalovirus promoter devoid of intronic sequences. 

5 73. An adenoviral vector in accordance with claim 69 wherein the 

transcription termination sequence is a bovine growth hormone polyadenylation and 
transcription termination sequence. 

74. An adenoviral vector in accordance with claim 69 which is at least 

partially deleted in E3. 
10 75. A cell comprising the adenoviral vector of claim 68. 

76. Recombinant, repUcation-defective adenovirus pardcles harvested 



and purffied subsequent to transfection of the adenoviral vector of claim 68 into a cell 
Une which expresses adenovirus El protein at complementing levels. 

77. An HIV vaccine composition comprising purified adenovirus 

15 particles of claim 76. 

78. An HIV vaccine composition of claim 77 which comprises a 

physiologically acceptable carrier. 

79. A method of producing recombinant, rephcation defective 
adenovirus particles containing the adenoviral genome of the adenoviral vector of 

20 claim 68 which comprises introducing the adenoviral vector into a host cell which 
expresses adenoviral El protein, arid harvesting the resultant recombinant, 
replication-defective adenovirus. 

80. A method according to claim 79 wherein the cell is a PER.C6® 

cell. 
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81 . A method of gen9rating a cellular-mediated immune response 
against HIV in an individual comptising administering to the individual a vaccine of 
claim 78. 

82. A method according to claim 81 which further comprises 
administration to the individual a DNA plasmid vaccine, optionally administered with 
a biologically effective adjuvant, protein or other agent capable of increasing the 
immune response. 

83. A method according to claim 82 wherein the DNA plasmid 
vaccine is administered to the individual prior to administration of an adenovmis 
vaccine. 

84. A method according to claim 81 wherein the adenovirus vaccine is 
preceded by an adenovirus vaccine of a different serotype. 

85« A method according to claim 81 which comprises administering 
and readminist^ing the adenovirus vaccine vector to the individual. 

86. A multivalent adenovims vaccine composition comprising 
recombinant, replication-^fective adenovhnis particles, wherein the adenovirus 
particles are harvested and purified from a cell line expressing adenovirus El J)rotein, 
and wherein the particles are harvested subsequent to transfection of the cells with an 
adenoviral vector or vectors in accordance with claim 9; said vector(s) comprising a 
gene expression cassette or cassettes comprising nucleotide sequences encoding HIV 
proteins selected from the group consisting of: 

a) gag, pol, and nef , expressed independently from three individual 
vectors; 
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b) gag, pol, and nef, expressed independently ftom one vector with 
the encoding nucleic acid sequences operatively linked to distinct 
promoters and transcription termination sequences; 

c) gag, pol, and nef, expressed via two vectors, one expressing a pol- 
5 nef fusion, and another expressing gag; 

d) gag, pol, and nef, expressed via two vectors, one expressing a gag- 
pol fusion and another expressing nef; 

e) gag, pol and nef, expressed via two vectors, one expressing a nef- 
gag fusion and another expressing pol; 

10 f) gag, pol, and nef, expressed via one vector expressing a gag-pol- 

nef fusion; 



~ g) gag and pol, expressed indeperid^y from two individual vectors; 

h) gag and pol, expressed independently from one vector with the 
encoding nucleic acid sequences operatively linked to distinct 

15 promoters and transcription termination sequences; 

i) pol and nef, expressed independently from two individual vectors; 
j) pol and nef, expressed independently from one vector with the 

encoding nucleic acid sequences operatively linked to distinct 
promoters and transcription termination sequences; 
20 k) nef and gag, expressed independenfly from two individual vectors; 

1) nef and gag, expressed independently from one vector with the 
encoding nucleic acid sequences operatively linked to distinct 
promoters and transcription termination sequences; 

m) gag and pol, expressed via one vector expressing a gag-poi fusion; 



1 
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n) pol and nef , expressed via one vector expressing a pol-nef fusion; 
and 

o) nef and gag, expressed via one vector expressing a nef-gag fusion. 

87. A multivalent adenovirus vaccine composition in accordance with 
5 claim 86 wherein the gag-pol fusion consists of SEQ ID NO: 39. 

88. A multivalent adenovirus vaccine composition in accordance with 
claim 86 wherein the fused sequences have the encoding nucleic acid sequences 
operatively linked to distinct promoters and transcription termination sequences. 

89. A multivalent adenovirus vaccine composition in accordance with 
10 claim 86 wherein the fused sequences have the encoding nucleic add sequences 

operatively linked to a single promoter; and the encoding nucleic acid sequences 
operatively linked by an internal ribosome entry sequence ("IRES")- 
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Original Adenovector Construct: 



06- 




£3* 



El- 
Figure 1: Original HIV-1 gag adenDvector. 
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Sequence of the open reading frame for F L^obq fhuman codon optimized) 

algggtgctagggcttctgtgctglctggtggtgagctggacaagtgggagaagatcaggctgaggcctggtgg 

caagaagaagtacaagctaaagcacattgtglgggcctccagggagctggagaggtttgolgtgadooctggc 

dgctggagacctcrtgagggglgcaggcdgatcctgggcxagctc^goc^tc^ctgc^aacaggct^^ 

agctgaggtaDcfglaoaacacagtggctaccctgtactgtgtgcaccagaagattgatgtgaaggac3ccaag 

gaggccxlggagaagattgaggaggagoagaacaagtccaagaagaaggcccagcaggotgctgclggc 

acaggc^aclocagccaggtgtcx^gadctaccccattglgcagaacdccagggccagatggtg^ 

gccatctac(xx:ggaixdgaatgcctggg1gaagg1ggtggaggagaaggocttctc^ 

catgtlctrtgccdgtctgagggtgoiaccccccaggacclgaaMCM^ 
aggclgccatgcagatgctgaaggagaccatcaatgaggaggctgctgagtgggacaggctgcatootgtgc 

acgctggcrecattgcc«jcgg«:agatgagggagccMggggctctgacangclgg 

cx^aggagcagattggctggntgaccaacaaccccoc^tc^c^gtgggggaaatctacaagaggtggatc^^ 

cctgggcctgaacaagatlgtgaggatgtactcccccacctccatcctggaratcaggcagggccc^ 

ccrttcagggactatgtggacaggnctecaagawjdgagggctgagcaggcctccraggagglgaagaart 

ggaigacagagaccxjtgctggtgcagaalgcxaaccctgadgcaagaccatcctgaaggccctgggccctg 

cigccaccctggaggagalgatgacagcclgccagggggtggggggccctggtcacaaggccagggtgclg 

gctgaggccatgtcccaggtgaccaactccgccaccatcalgalgcagaggggcaactlcaggaaccagag 

gaagacaglgaagtgcticaadgtggcaaggtgggcoacattgccaagaactgtagggcccccaggaaga 

agggctgclggaaglglggcaaggagggccaccagatgaaggactgcaalgagaggcaggccaacttcrtg 

ggcaaaatctggccctcccacaagggcaggcctggcaacttcctccagtccaggcctgagcccacagcccct 

CKjgaggaglccttcaggtttggggaggagaagaccacccccagccagaagcaggagcccatlgacaagg 

agclgiac(xcclggcctc«rtgagglccctgtttggcaacgaccwtcc!cccagtaaaataaagoccgggca 

gat (SEQIDNO:29} 
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Old Transgene: 



hCMV 



PA 



NewTransgenes: 



hCMV 



GAG 



BGH 



GAG 




mCMV 



Figure 3: Diagrammatic representation of the original HIV-1 gag transgene and the series of 
new transgene constructions. 
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Figure 4: Modif ications made to the current adenoveclor backbone in the generation of the new 
vector. 
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(packaging exi.) 



EI E3- 



£xpt«a £3 £3 

(E3 vsE3*) 



El- £3- 
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I 



Fioure 5- virus mixino experiments to determii^e the ef eels of the addition made to the packaging sjgrjal 
region (Expt #1 ) and ena»>'5ts of tha effacis ol the E3 gana on viral growth (Expl. #2). The red bars denote 
the region of mod'rfioalions made to the El deletion. 
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Figure 6: AuloracJiograpr* o1 viral DNA anafysb following viral mixing experiments (ezpts. #1 
and 412) as detailed in the text. 
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UK ^ y SAG »»c»/v "^1 
I iMS^^^r ^y^t ^fim : gj. 1 
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Fiaure 7A- hCMV.FLsa9.bGHpA aitenoveclors construertwJ w«hin the •MRK" backbone. Ei 
^aSli^EiZp^^Utr^Sfir^ orientation wtthlnth, £3- and E3. backbones were 
construciBd. 



7/144 



wo 02/22080 



PCT/USOl/28861 



i 1 il' "11 'I 'm I 



£3* 



«r 



cm SAG 



tCM.r 



E3* 



»*^*^ GAS K** 



E3: 



1^ T 



»CMV' SAG fiM 



E3* 



FiQure 7B- hCMV-FLgaQ-SP A adenoveciors consirucled within the ^MRK" backbone. E1 
pa^llel and El entiparallel uan^anc orienlalion wtthin the E3- and E3+ backbones were 
constructed. 
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Figure 7C: mCMV-FLgag-bGHpA actenoveciors constfucled within the*MRlC backbone. El 
parallel and E1 antiparallel transgene orientation within the E3- and E3+ backbones were 
constnjcted. 
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Plasmid mixing expt: (orientation) 




Figure 8A: Effect oi transgene orientation 
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Plasmid Mixing expt: (poiy a signal) 




E3' 



Figure &B: Effect of polyadenylation signal 
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Figure 9: Viral DNAfrom the four Adgag candidates at P5, folbwing BsfEII digestion. 
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Figure 10: Viral DMA analysis of passage 11 and^or 12 of MRKHVE3. MRKAdSgag and 
MRKAd5gag(E3-}. 
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Figure 11: Viral DMA analysis (H/ndlll digestion) of passage 6 MIRKHVE3 and M.^KAdSgag used 
to initiate the viral conripetition study. Last two lanes are passage 11 analysis of dupiicale 
passages ot the competition study (each virus at MOI 280 vp). 
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Ftaurc 12- Vira! DNA anafvsis by Hrndlll digestion on high passage numbers lor MRK^S^S 

cor^^^^^ wim coBections made at specKled times. The 1^^';^^^;?^^^^^ 
DNAsSriw^^ The other lanes represent pre-plasmid control (digested wrth Pad and Hindlll). 
?nd MrSaX^^^ panged to P16. P1B and P2nserum containing media). 
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0 

Figure . Smm ami-p24 Levels al 3 ^lis post i.jn. imniunizaiion of balb'c mice (n«10) 
wiih N'arv-ing Doses of Several Adgag constructs: (A) MKJCAdSgag (through passage 5): 
(B) hmUd5 £3* bCMV.FLgag-bGHpA; (C) MRK.Ad5 E3' bCMV-FLgag-SPA; (D) 
MRXAd5 E3* mCMV-FLgag-bGHpA; (D) research Loi (293 cell-derived) of AdSHIV- 
]gag; and (F) clinical loi (Ad5gagFKOO0)) of AdSHIV-lgag. Reponed are the geometric 
mean liiers (GMT) for each cohor\. 
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MeKlleSerProlleGluThrVolPro^ 
^ I 10 20 

GCAGTGGCCCCTGACTGAGGACAACATCMIGCCaGCTGGAAATCTC^^^ 
ilnTrpProLeuThrGliXJlMSllelysAloLeuVolGluIleCysThrCluMelGluLysGluGlyLysIl^ 

30 

AGATlGGCCCOSAGAACCCCTACAACACtCCTGTGTnGCCATCAAGAAGAAGGACT 
yslleClS^luAsnProTyrAsnThrProVolPheAlolleLysLysLysAsp&^^^^ 



60 



80 5^ 

CAAGAAGTCTCTGACIGTGCIGCCTGTGGGGGATGCCIACTICTCTGTGCCCCTGGATG^^^^ 

CCTTCACCAICCCCICCATCAACAATGAGACCCCTGGCATCAGGTACCA^^^ 
I oPheTnr ) I cPr oScr 1 1 eA^nAsnG 1 ulhrProU y ! leArgTyr^ 

140 

TCOXTGCCATCTTCCACTCCTCCATGACCAAGATCCTrcAGCCCTTCAGGAAGCASA^^^ 
sSollePhSTnSerSerMelThrLyslleLeLCIuProPheAr^^^^ 

150 "0 



CTACATGGCTGCCCIGTATGTGGGCICTGACCTGWGAITGGGCAGCAC^^^^ 
SelAUloleuTyrVolGlySerAspLeuGlulleGlyGlnl^^^^ 
190 200 



TGClGADGTGGGGCClGACCACCrciGACAMJAAGCACCttA^a 
cuLeuAroTrpGlyLeuThrlhrProAsplysLysHlsClnLysGluProProPheLeulrpMelGlyTyrt 

220 

240 250 /t)U 

CAAGCTGAACTGGGCCTCCCAAAlCTACaTGGCAlW^TGA^ 
yLysLeuAsnTrpAloSerGlnlleTyrProGlylleLysVolArgGlnleuCysLysLeoLeuArgPlyT^ 



HGURE 17A 
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7GACTGAGGTGATCCCCC1GACTGAGGAGGCTGAGCTGGAGCTGGCTGAGAACAGGCADATCCTGAAGGACCCTGTGCAT 
EuThrCluVolllePfoLeuThrGluGluAloGluleuGluLeuAloGluAsnArgGlulleLeuLysGluProVolHis 

300 310 

HXXlTGTACTAlCACCCCTCCAAGCACCTGATlCCTCACAlCCAGAAGCAGGGCCAfiGG^ 
GlyVolTyrTyrAspProSerlysAspLeul leAloGIuI leGlnLysGlfiGlyGlnGlyGInTrpThrTjrrGlnl leTy 

320 330 340 

CCAGGAGCCCTTCAAGAACCTGAABACTCGCM;TATG(XyC6ATGADGGGGGa^^ 
rGlnGluProPhcLysAsnLcuLysThrGlyLysTyrAloArgMelArgGlyAloHisThrAsnAspVolLysGlnLeuT 

350 360 370 

Ci:-AGGCTGTGCAGAAGATCACCAC7CACTCUTTGTGA1CTGGGGCAAGACCCCCAAGTTCAAD^ 
hrGluAloVolGlnLyslleThrThrGluSerllcVollleTrpGlyLysThrProlysPheLysLeuProlleGlnLys 

380 390 

GAGACCTGGGAGACCTGGTGGACTGAGUCTGGCAQGCCACCTGGATCCCIGAGTGGGAGTTTGTGAACACCCCCCCCCT 
GluThrlrpCluThrTrpTrpThrGluTyrTrpGlnAloThrTrpIleProGluTrpGluPhcVolAsnThrProProLe 
400 410 420 

GGlCAAGCTGlGGTACCAGCTGGAGAAGGAGCCCATlGlGGGGGCTGAGACCnCTATGTGGCTGGGGCTGCCAACAGGG 
uVolLvsLeulrpTyrGlnLeuGluLysGluProlleVolClyAloCluThrPheTyrVolAloGlyAloAloAsnArgG 

430 440 ~~ 450 

AGACCAAGCTGGGCAAGGCICGCTAlGTGACCAACAGGGCCAGGCAGAACGiGGTGACCCTGACTGACACCACCAACC^ 
I ulhr L ysLeuG I yLysA I oG I yTyrVo I Thr AsnArgC I yAr gG I nLysVo I Vo I ThrLeuThr AspThrThr AsnG I n 

4S0 470 

AAGACTGCCCTCCA0GCCATCTACCTGGaX:TCCADGACTCTG6aTCGACCTGAACA77CTCACT6CCTCCCAGTATGC 
LysThrAloLeuGlfiAIoIleTyrLeuAloLeuGlnAspSerGlyLeiXJIuVolAsnlleVolThrAloSerClnTyrAI 

480 490 WO 

CCICGGCATCATCCAGGCCCAGCCIGATCACTCTGACTCTGAGCTGGTCAACCAGATCATTGAGCAGCTGATCA^ 
oLeuGlyllelleGlnAloGlnProAspGlnSerGluSerGluLeuVolAsnGlnlielieGliCinLeuIlelysLysG 

510 520 530 

/SCAACGTGTACCTCGCCTGGGTGCCTGCOCADWDODCATTGODGGCA^^^ 

luUsVotTyrLeuAloTrpVoiProAloHisLysGlylieGly&lyAsnGluGlnVolAspLysLeuVotSerAloCly 

540 550 

AlCAGGAACGTrcTGTTCCTGGATCCCATTCACAAGCCCCAGGATGAGCATGAGAAGlACCACT^ 
1 1 eAr gL y sVo I L euPheLeuAspG 1 y 1 1 eAspLys A I oG I nAspG I uH i sG I uLysTyrH I sSer AsnTrpAr gA I oMe 
5B0 570 580 
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GGCCTCTGACTlCAACCTGCCCCCTGTI^TGGCTAAKAGATTGTGGCCTtim.^^ 
tAloScrAspPheAsnLeuPrrf>roVoIVolAloLysCiuIleVolA!oSerCysAspLy 

590 

CCATGCATGGGCAGGTGGACTGC7C(XCTGGCATC7GGCAGCTGGCCTGC^^ 
|(«elHisGlyGlnVoIAspCysSerProGlylleTrpGlnLeuAlpCysThrH.sLci^^ 

520 63£) 

GCTGTGCATGTGGCCTKGGCTACATTGAGGCTGAGGTGATCCCTGCTCAGACAH^ 
AloVomirVo^iloSerGlyTyrlleGluAlaGloVolIleProAloGluT^ 

640 6W 



GAAGC1GGCTGGCAGGTGCCCTGTGAAGACCA7CCACA:TGCCAATGGCTC^^^ 
uLySfiSlflGlyArgTrpProVoiLysThrlleHisThrAloAsrClj^^ 

670 ^ 

CCTGCTGGTGGGCTCGCATCAAGCAGGAGlTlGGCAlCCCCTA(>ACCajAGJ^ 
l^iysTrpTrpAloGlylleLysGlnGluPheGlylleProTyrAsnProGlnSerG Jly^ 

' 700 ™ 

AAGG>SCTGAAGAAGATCATTG0GCAGGTGAGG6ACCACGCTGAGCACCTGA^^^ 
uSluLeuLysLysllelleGlyGlnVolArgAspGlnAloGluHisLeuLysTnrAloVolGlnl^W 

_ 720 — - ' " 



CCACAACTTCAAGAGGAAGGGGGGCATCGGGGGCTACTCCCCTO^ 

eHisAsr«.eLysAr9LysGlyCiylleGlyGlyTyrSerAlaGlyGuArg!lcVolAspIleIleAloThrte^^ 

750 

AGACCAAGGAGCTCCAGAAGCAGATCACCAAGATCCAGAACTTCADGGTGUCTAW^ 



AAGGGCCCTGCCAABCTQCIGTGGAAGGGGGACGCGGCTGTGGTCATCCAGGACW^ 
fSoAfolysledeuTrpLysGl^ 

BOO 810 

J30 



AAADCCCGGCCAGATCL i^R^S?:!^ 
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wr 

OPT 

WT 
OPT 

WT 
OPT 

WT 

OPT 

WT 
OPT 

WT 

OPT 

WT 
OPT 

WT 
OPT 

WT 
OPT 

WT 
OPT 



ATG GGT QGC AAG TGG TCA AM CGT AGT GT6 CCT GGA TGG TCT -42 



I I II III III III II II I HI il 11 
ATG GGC GGC AAG TGG TCC AAG AGG TCC GTG CCC GGC TGG 
H66KWSKRSVPGW 



I 

CC 
S 



ACT GTA AQG" GAA AGA ATG AGA C6A GCT GAG CCA GCA GCA GAT 
I 



II II III II I II III II II II II 

ACC GTG AGG GAG AGG ATG AGG AGG GCC GAG CCC GCC GCC GAC 
T VRERHRRAEPAAD 

AGG ETTG AGA CGA ACT GAG CCA GCA GCA GTA QGG GTG GGA GCA 

III III il I II III II II II II II III II II 
AGG GT6 AGG AGG ACC GAG CCC GCC GCC GTG GGC GTG GGC GCC 

RVRRTEPAAVeVGA 

GTA TCT C6A GAC CTG GAA AAA CAT GGA GCA ATC ACA AGT AGC 

GTG TCC AGG GAC CTG GAG AAG CAC GGC GCC ATC ACC TCC TCC 
VSRDLEK HGAITSS 



AAT ACA GCA GCT ACC AAT GCT GAT TGT GCC TGG CTA GAA GCA 

II II II II Mi il II II II III III II II II 

AAC ACC GCC GCC ACC AAC GCC GAC TGC GCC TGG CTG GAG GCC 
NTAATNADCAWLEA 



,, ,,, 1 ,, 

CAG GAG GAC GAG GAG GTG GGC TTC CCC GTG AGG CCC CAG GTG 
QEDEEVGFPVRPQV 

CCT TTA AGA CCA ATG Aa TAC AAG GGA GCT GTA GAT CTT AGC 



II II I I 



II 11 il I 



CCC CTG AGG CCC ATG ACC TAC AAG GGC GCC GTG GAC CTG TCC 
PLRPHTTKGAVDLS 

CAC TH TTA AAA GAA AAG GGG GGA CTG GAA GGG CTA ATT CAC 

III II I 11 II III il II I I ii ii il II III 
CAC TTC CTG AAG GAG AAG GGC GGC CTG GAG GGC CTG ATC CAC 
HFLKEKG6LE6LIH 

TCA CAG AAA AGA CAA GAT ATC CTT GAT CTG TGB GTC TAC CAC 

II ill II II II II i I ii Ii i I ili I Mi III 
TCC CAG AAG AGG CAG GAC ATC CTG GAC CTG TSB GTG TAC CAC 
SQKRQDILDLWV Y H 

ACA CAA GGC TAC TTC CCT GAT TGG CAG AAC TAC ACA CCA GGG 

iic ii. Ill lii Hi iic iic lii ^ilUii lii lie iic iic 

TQGYFPDWONYTPG 



•14 
-84 

•28 
-126 

-42 
-168 

•66 
-210 

•70 



CAA GAG GAT GAG GAA GTG GGT TTT CCA GTC AGA CCT CAG GTA -252 



•84 
•294 

•98 

•336 

•112 
-378 

•126 
•420 

•140 
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wr 

OPT 

WT 
OPT 

WT 

W>T 

VfT 
OPT 

vnr 

DPT 

WT 

OPT 



CCA 6GA ATC AGA TTT CCA TTG ACC TTT BOA TGG TQC TTC AAG 

II II I I II II II II III II II 
CCC BGC ATC AGG TTC CCC CTB ACC TTC GGC 
P6IRFPLTFG 



GG TGC TTC AAG 
W C F K 



CTA GTA CCA GH BAG CCA GAA AAG GTA 6AA GAG GCC AAT 6AA 

II II II II III II II III 11 II ill ill il II 
CTG STG CCC GTG GAG CCC GAG AAG 6TG BAG GAG GCC AAC GAG 
LVP VEPEKVEEANE 

GGA GAG AAC AAC TGC TTG TTA CAC CCT ATG AGO CAG CAT GGG 

II III III III III II I III II III I III II II 
GGC GAG AAC AAC TGC CTG CTG CAC CCC ATG TCC CAG CAC GGC 
6ENNCLLHPMSQHB 



,,, II III I I III III III I 

ATC GAG GAC CCC GAG AAG GAG GTG CTG BAG TGG AGG TTC GAC 
lEDPEKEVLEWRFD 



II II ill III III I 



II II 



TCC AAG CTG GCC TTC CAC CAC GTG GCC AGG GAG CTG CAC CCC 
SKLAFHHVARELHP 



GAG TAC TAC AAG GAC TGC T6A 

III III III III ill Hi i I 
GAG TAC TAC AAG GAC TGC TAA 

E Y Y K D C 



462 



154 



504 



168 



546 



1G2 



ATA GAG GAC CCG GAG AAG GAA GTG TTA BAG TGG AGG TFT GAC -588 



-196 



ABC AAG CTA GCA TU CAT CAC GfTG 6CC C6A GAG CTG CAT CCG -630 



(SEQ ID N0:30) 

(contained within seqidkO:9) 

(SEQfi>NO:10) 



•210 



•651 



216 
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HIV'Pohinact (opt) 



WRKpds 




KanR 



Bgll1/phpsph8tase 



Egll ^(iSDlate pol gene fragment) 



T 



Ligation 

Screen for correct orientation of pol 




HIV'Pd'inact (cpt) 




Ctal digest pre-plasmid 



ta 1109114139. 

Digest With Pad and Bst1107 ( (or Bst2107 1) 



Bacteria! Homologous Recombination (BJ51 B3) 
XL-1 blue (plasmid amplification) 




FIGURE 22 
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Bgi 1 1/phosphatese Bgll 1(isolale nef gene fragment) 




as 11971 14139J 

Clal digest pre-p)asmid Digest with Pad and Bstl107 I (or Bsl2107 )) 



Bacterial Homobgous Recombination (BJ5183) 
XL*1 blue (ptasmid amplification) 



Figure 2: Construction of pre-plasmid "pMRKAdSner. 
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MRKAdSpol MER1062 
(MRKAdS Pre-Adenoviral Vector Containing the lA opt pol Cooing Kegon) 



1 CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TGATAATGAG 
GTAGTAGTTA TTATATGGAA TAAAACCTAA CTTCGGTTAT ACTATTACTC 

51 GGGGTGGAGT TTGTGACGTG GCGCGGGGCG TGGGAACGGG GCGGGTGACG 
CCCCACCTCA AACACTGCAC CGCGCCCCGC ACCCTTGCCC CGCCCACTGC 

101 TAGTAGTGTG GCGGAAGTGT GATGTTGCAA GTGTGGCGGA ACACATGTAA 
XTCATCACAC CGCCTTCACA CTACAACGTT CACACCGCCT TGTGTACATT 

151 GCGACGGATG TGGCAAAAGT GACGTTTTTG GTGTGCGCCG GTGTACACAG 
CGCTGCCTAC ACCGTTTTCA CTGCAAAAAC CACACGCGGC cacatgtgtc 

201 6AAGTGACAA TTTTCGCGCG GTTTTAGGCG GATGTTGTAG TAAATTTG6G 

cttcactgtt aaaagcgcgc caaaatccgc ctacaacatc atttaaaccc 

251 cgtaaccgag taagatttgg ccattttcgc gggaaaactg aataagagga 
gcattggctc attctaaacc ggtaaaagcg cccttttgac ttattctcct 

301 agtgaaatct gaataatttt gtgttactca tagcgcgtaa tatttgtcta 
tcactttaga cttattaaaa cacaatgagt atcgcgcatt ataaacagat 

351 gggccgcggg gactttgacc 6tttac6tgg agactcgccc aggtgttttt 

CCCGGCGCCC CTGAAACTGG CAAAO'GCACC TCTGAGCGGG TCCACAAAAA 

401 CTCAGGTGTT TTCCGCGTTC CGGGTCAAAG TTG6CGTTTT ATTATTATAG 
GA6TCCACAA AAGGCGCAAG 6CCCAGTTTC AACCGCAAAA TAATAATATC 



451 GCGGCCGC6A TCCATTGCAT ACGTTGTATC CATATCATAA TATGTACATT 
CGCCGGC6CT AGGTAACGTA TGCAACATAG GTATAGTATT ATACATGTAA 

501 TATATTGGCT CATG7CCAAC ATTACCGCCA TGTTGACATT GATTATTGAC 
ATATAACC6A GTACAGGTT6 TAATGGCGGT ACAACTGTAA CTAATAACTG 

551 TAGTTATTAA CAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA 
ATCAATAATT ATCATTAGTT AATGCCCCAG TAATCAAGTA TCGGGTATAT 

601 T06AGTTCCG CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG 
ACCTCAAGGC CCAATGTATT GAATGCCATT TACCGGGCGG ACCGACTGGC 

651 CCCAACGACC CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT 
GGGTTGCTGG GGGCGGGTAA CTGCA6TTAT TACTGCATAC AAGGGTATCA 

701 AACGCCAATA GGGACTTTCC ATTGACGTCA ATGGGTGGAG TATTTACGGT 
TTGCGGTTAT CCCTGAAAG6 TAACTGCAGT TACCCACCTC ATAAATGCX:A 

751 AAACTGCCCA CTTGGCAGTA CATCAAGTGT ATCATATGCC AAGTACGCCC 
TTTGACGGGT GAACCGTCAT GTAGTTCACA TAGTATACGG TTCATGCGGG 

801 CCTATTGACG TCAATGACGG TAAATGGCCC GCCTGGCATT atgcccagta 

ggataactgc agttactgcc atttaccggg cggaccgtaa tacgogtcat 
851 catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 

GTACTGGAAT ACCX:TGAAAG GATGAACXTGT CAT6TAGAT6 CATAATCAGT 
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901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 



TCGCTATTAC diPbCTGATG CGCTTTTOGC AGTACATCAA TGGGCC||pA 
AGCGATAAT6 GTACCACTAC GCCAAAACCG TCATGTAGTT ACCCGCACCT 

TAGCGGTTTG ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA 
ATCGCCAAAC TGAGTGCCCC TAAACGTICA GAGGTGGGGT AACTGCAGTT 

TGGGAGTTTG 7TTTGGCACC AAAATCAACG GGACTTTCCA AAATGTCGTA 
ACCCTCAAAC AAAACCGTGG TmAGTTGC CCTGAAAGGT TTTACAGCAT 

ACAACTCC(X CCCATTGACG CAAATGGGCG GTAGGCGTGT ACGGTGGGAG 
TGTTGAGGCG GGGTAACTGC GTTTACCrCC CATCC6CACA TGCCACCCTC 

GTCTATATAA GCAGAGCTC6 TTTAGTGAAC CGTCAGATCG CCTGGAGACG 
CAGATATATT CGTCTCGAGC AAATCACTTG GCAGTCTA6C QGACCTCTGC 

CCATCCACGC TGTTTTGACC TCCATAGAAG ACACCQGGAC CGATCCAGCC 
GGTAGGTGCX3 ACAAAACTGG AGGTATCTTC TGTG6CCCT6 GCTAGGTCGG 

TCCGCGGCCG GGAACGGTGC ATTG6AAC6C QGATTCCCC6 TGCCAAGAGT 
AGGCGCCGGC CCTT6CCACG TAACCTTGCG CCTAA6GG6C ACGGTTCTCA 

GAGATCTACC ATCGCCCCCA TCTCCCCCAT TGAGACTGTG CCTGTGAAGC 
CTCTAGArGG TACCGG6GGT AGAGG5GGTA ACTCTGACAC CGACACTTCG 

TGAAGCCTGG CATGGAT6GC CCCAASCTCA AGCAGTGGCC CCT6ACTGAG 
ACTTCGGACC GTACCTACCG GGGTTCCACT TCGTCACCGG G6ACTGACTC • 

6AGAAGATCA AGGCCCT6GT GGAAATCTGC ACTGAGATGG AGAAGGAGGG 
CTCTTCTAGT TCCGGGACCA CCTTT7VGACG TGACTCTACC TCTTCCTCCC 

CAAAATCTCC AAGATTG6CC CCGAGAACCC CTACAACACC CCTGTGTTTG 
GTTTTAGAGG TTCTAACCGG GGCTCTTGGG GATGTTGTGG 6GACACAAAC 

CCATCAAGAA GAAGGACTCC ACCAAGTGGA GGAAGCTGGT GGACTTCAGG 
GGTAGTTCTT CTTCCTGAGG TGGTTCACCT CCTTCGACCA CCTGAAGTCC 

6A6CTGAACA AGAGGACCCA G6ACTTCTGG GA6GTGCAGC TGGGCATCCC 
CTCGACTTQT TCTCCTGGGT CCTGAA6ACC CTCCACGTCG ACCCGTAGGG 

CCACCCCGCT CGCCTGAACA AGAAGAAGTC TGTGACTGTG CTGGCTGTGG 
GGTGGGGCGA CCGGACTTCT TCTTCTTCAG ACACTGACAC CACCGACACC 

GG6ATGCCTA CTTCTCTGTG CCCCTGGATG AGGACTTCAG 6AAGTACACT 
CCCTACG6AT GAAGAGACAC GGGGACCTAC TCCTGAAGTC CTTCATGTGA 

6CCTTCACCA TCCCCTCCAT CAACAATGAG ACCCCTGGCA TCAGGTACCA 
CGGAAGTC6T AGGGGAGGTA GTTGTTACTC TGGGGACCGT AGTCCATGGT 

GTACAATGTG CTGCCCCAGG GCTGGAAGGG CTCCCCTGCC ATCTTCCAGT 
CATCTTACAC GACGGGGTCC CGACCTTCCC GAGGGGACGG TAGAAGGTCA 

CCTCCATGAC CAAGATCCTG GAGCCCTTCA GGAAGCAGAA CCCTGACATT 
GGAGGTACTG GTTCTAGGAC CTCGGGAftGT CCTTCGTCTT GGGACTGTAA 

GTGATCTACC AGTACATGGC TGCCCTGTAT GTGGGCTCTG ACCTGGAGAT 
CACTAGATGG TCATGTACCG AC6GGACATA CACCCGAGAC TGGACCTCTA 
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1851 TGOGCAGCAC ^.HCCAAGA TTGAGGAGCT GAGGCAGCAC CTGCTG«T 
ACCCGTCGT6 TCCTGGTTCT AACTCCTCGA CTCCGTCGTG GACQACTCCA 

1901 GGGCCCTGAC CACCCCTGAC AAGAAGCACC A6AAGGAGCC CCCCTTCCTG 
CCCCGGACTG GTGGGGACTC TTCTTCGTGG TCTTCCTCGG OGGGAAjSGAC 

1951 TGGATGGGCT ATGAGCTGCA CCCCGACAAG TGGACTGTGC AGCCCATTGT 
ACCTACCCGA TACTCGACCT GCC6CTCTTC ACCTGACACG TCGGGTAACA 

2001 GCTGCCTGA6 AAGGACTCCT GGACTGTGAA TGACATCCAG AAGCTGGTGG 
CGACGGACTC TTCCTGAGGA CCTGACACTT ACTGTAGGTC TTCQACCACC 

2051 GCAA6CTGAA CTCGGCCTCC CAAATCTACC CTGGCATCAA 6GTGAGGCAG 
CGTTCGACTT GACCCGGAGG GTTTA6ATGG 6ACCGTAGTT CCACTCCGTC 

2101 CTGTCCAACC TCCTGAGGGC CACCAAGGCC CTGACTGAGG TGATCCCCCT 
GACACGTTCG ACQACTCCCC OTGGTTCCGG 6ACTGACTCC ACTAGGGGGA 

2151 GACTGAGGAG GCTGAGCTGG AGCTGGCTGA GAACA6G6AG ATCCTGAAGG 
CTGACTCCTC CGACTCGACC TCGACCGACT CTTGTCCCTC TAGGACTTCC 

2201 AGCCTGTGCA TGGGGTGTAC TATGACCCCT CCAAGGACCT GATTGCTGAG 
TCG6ACACGT ACCCCACATG ATACTGGGGA GGTTCCTGGA CTAACGACTC 

2251 ATCCAGAAGC AGGGCCAGG6 CCAGTGGACC TACCAAATCT ACCAGGAGCC 
TAGGTCTTCG TCCCGGTCCC GGTCACCTGG ATGGTTTAGA TGGTCCTCGG 

2301 CTTCAAGAAC CTG.V^GACTG GCAAGTATGC CAGGATGAGG GGGGCCCACA 
GAAGTTCTTG GACTTCTGAC CGTTCATACG GTCCTACTCC CCCCGGGTGT 

2351 CCAATGATGT GAAGCAGCTG ACTSAGGCTG_TGCAGAAGAT_CACCACTGAG 
GGTTACTACA^'ST^^^^ CTGCTGACTC 

2401 TCCATTGTGA TCTGGGGCAA GACCCCCAAG TTCAAGCTGC CCATCCAGAA 
AGGTAACACT AGACCCCGTT CTGGGG6TTC AAGTTCGACG GGTAGGTCTT 

2451 GGAGACCTGG GAGACCTGGT GGACTGAGTA CTG6CAGGCC ACCTGGATCC 
CCTCTGGACC CTCTGGACCA CCT6ACTCAT GACCGTCCGG TGGACCTAGG 

2501 CTGAGTGGGA GTTTGTGAAC ACCCCCCCCC TOGTGAAGCT CTGGTACCAG 
GACTCACCCT CAAACACTTG TGGGGGGGGG ACCACTTC6A CACCATGGTC 

2551 CTGGAGAA6G AGCCCATTGT GGGGGCTGAG ACCTTCTATG TGGCTGGGGC 
GACCTCTTCC TCGGGTAACA CCCCCGACTC TGCAAGATAC ACCGACCCCG 

2601 TGCCAACAGG GAGACCAAQC TGG0CAAGOC TGGCTATGT6 AOCAACAGGG 
ACGGTTGTCC CTCTG6TTCG ACCCGTTCCG ACCGATACAC TQGTTGTCCC 

2651 GCAGGCAGAA GGTGGTGACC CTGftCTGACA CCACCAACCA GftAGACTGCC 
CGTCCGTCTT CCACCACTGG GACTGACTGT GGTGGTTQGT CTTCTGACGG 

2701 CTCCAOGCCA TCTACCTGGC CCTCCAGGAC TCTGGCCTGG AGGTGAACAT 
GAGOTCCGGT jWSATGGACCG GGAGGTCCTG AGACCGGACC TCCACTT6TA 

2'751 TGTGACTGCC TCCCAGTATG CCCTGGGCAT CATCCAGGCC CAGCCTGATC 
ACACTGACQQ AGGGTCATAC GG6ACCCGTA GTAGGTCCGG GTCGGACTAG 
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28D1 AGTCTGAGTC l^fcCTGGTG AACCAG&TCA TTGAGC»GCT CATCAA^G 
TCAGACTCAG AOTGACCAC TT6GTCTAGT AACTCGTCGA CTAGTtWc 

2851 GAGAA6GTGT ACCT6GCCTG GGTGCCTGCC CACAAGGGCA TTGGGGGCAA 
CTCTTCCACA TGGACCGGAC CCACGGftCGG GTGTTCCCGT AACCCCCGTT 

2901 TGAGCAOGTG 6ACAAGCTGG TGTCTOCTG6 CATCAQGAAG GT6CTGTTCC 
ACTC6TCCAC CTGTTCGACC ACAGACGACC GTAGTCCTTC CACGACAAGG 

2951 TGGATOGCAT TGACAAGQCC CAGGATGAGC ATGAGAAGTA CCACTCCAAC 
ACCTACCGTA ACTGTTCCGG GTCCTACTCG TACTCTTCAT CGTGAGGTTG 

3001 TGGAGOGCTA TOGCCTCTGA CTTCAACCTG CCCCCTGTGG TGGCTAAG6A 
ACCTCCCGAT ACCGGAGACT GAAGTTGGAC GGGGGACACC ACCGATTCCT 

3051 6ATTGTGGCC TCCTGTGACA AGTGCCAGCT GAAGGGGGAG GCCATGCATG 
CTAACACCGG AGGACACTGT TCAC6GTCGA CTTCCCCCTC CCGTACGTAC 

3101 GGCAGGTGGA CTGCTCCCCT GGCATCTGGC AGCTGGCCTG CACCCACCTG 
CCGTCCACCT SACGAGGGGA CCGTAGACCG TCGACCGGAC GTGGGTGGAC 

3151 GAG6GCAAGG TGATCCTGGT GGCTGTGCAT GTGGCCTCCG GCTACATTGA 
CTCCCGTTCC ACTAGGACCA CCGACACGTA CACCGGAGGC CGATGTAACT 

3201 GGCTGAGGTG ATCCCTGCTG AGACAGGCCA GGAGACTGCC TACTTCCTGC 
CCGACTCCAC TAGGGACGAC TCTGTCCGGT CCTCTGACGG ATGAAGGACG 

3251 TGAAGCTGGC TGGCAGGTGG CCTGTGAAGA CCATCCACAC TGCCAATGGC 
ACTTCGACCG ACCGTCCACC GGACACTTCT GGTAGGTGTG ACGGTTACCG 

3301 TCCAACTTCA CTGGGGCCAC AGTGAGGGCT GCCTGCTGGT 6GGCTGGCAT 
AGGTTGAAGT CACCCCG6TG TCACTCCCGA CGGACGACCA CCCGACCGTA 

3351 CAAGCAGGAG TTTGGCATCC CCTACAACCC CCAGTCCCAG GGGGTGGTGG 
GTTCGTCCTC AAACCGTAGG GGATGTTG6G GGTCAGGGTC CCCCACCACC 

3401 CCTCCATGAA CAAGGAGCTG AAGAAGATCA TTGGGCAGGT GAGGGACCAG 
GGAGGTACTT GTTCCTCGAC TTCTTCTAGT AACCCGTCCA CTCCCTGGTC 

3451 GCTGAGCACC TGAAGACAGC TGTGCAGATG GCTGTGTTCA TCCACAACTT 
CGACTCGTGG ACTTCTGTC6 ACACGTCTAC CGACACAAGT AGGTGTTGAA 

3501 CAAGAGGAAG GGGGGCATCG GGGGCTACTC CGCTGGGGAG AGGATT6TGG 
GTTCTCCTTC CCCCCCTAGC CCCCGATGAG OCGACCCCTC TCCTAACACC 

3551 ACATCATTGC CACAGACATC CAGACCAAGG AGCTCCA6AA GCAGATCACC 
TGTAGTAACG GTGTCTGTAG GTCTQGTTCC TCGAGGTCTT CGTCTAGTGG 

3601 AAGATCCAGA ACTTCAOOOT GTACTACAGG GACTCCA6GA ACCCCCTGTG 
TTCTAGGTCT TGAACTCCCA CATCATCTCC CTGAGGTCCT TGGGGGACAC 

3651 GAAGG6CCCT GCCAAGCTGC TGTGGAAOGG 6QAGGGG6CT GTGGTGATCC 
CTTCCCGGGA CGGTTC<SACG ACACCTTCCC CCTCCCCCGA- CACCACTAGG 

3701 AGGACAACTC TGACATCAAG GTGGTGCCCA GGAGGAAGGC CAAGATCATC 
TCCTGTTGAG ACTGTAGTTC CACCACOGGT CCTCCTTCCG GTTCTAGTAG 
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3751 



ACGGACTATG 




TCCCTGATAC ciiTC(3TCTA CCGACCCCTA CTGACACACC GGAGGl^T 

3801 GGATGAGGAC TAAAGCCCGG GCAGATCT6C TGTGCCTTCT AGTTGCCAGC 
CCTACTCCTG ATTTCGGGCC CGTCTAGACG ACACGGAAGA TCAACGGTCG 

3851 CATCTGTTGT TTGCCCCTCC CCCGTGCCrT CCTTGACCCT GGAAGGTGCC 
CTAGACAACA AACGGGGA6G GGGCACGGAA GGAACTGGGA CCTTCCACGG 

3901 ACTCCCACTG TCCTTTCCTA ATAAAATGAG GAAATTGCAT CGCATTGTCT 
T6AGGGTGAC AGGAAAGGAT TATTTTACTC CTTTAACGTA GCGTAACAGA 

3951 GAGTAGGTGT CATTCTATTC TGGGGGGTGG GGTGGGGCAG GACAGCAAGG 
CTCATCCACA GTAAGATAAG ACCCCCCACC CCACCCCGTC CTGTCGTTCC 

4001 GGGAGGATTG GGAAGACAAT AGCAGGCATG CTGGGGATGC GGTGGGCTCT 
CCCTCCTAAC CCTTCTGTTA TCGTCC6TAC GACCCCTACG CCACCCGA6A 

4051 ATGGCCGATC GGCGCGCCGT ACTGAAATGT GTGGGCGTGG CTTAAGGGTG 
TACCGGCTAG CCGCGCGGCA TGACTTTACA CACCCGCACC GAATTCCCAC 

4101 GGAAAGAATA TATAAGGTGG GGGTCTTATG TAGTTTTGTA TCTGTTTTGC 
CCTTTCTTAT ATATTCCACC CCCAGAATAC ATCAAAACAT AGACAAAACG 

4151 AGCAGCCGCX: CCCGCCATGA «CACCAACTC GTTTGATGGA AGCATTGTGA 
TCGTCGGCGG CGGCGGTACT CGTGGTTGAG CAAACTACCT TCGTAACACT 

4201 GCTCATATTT GACAACGCGC ATGCCCCCAT GGGCCGGGGT GCGTCAGAAT 
.CGAGTATAAA CTGTTGCGCG TACGGGGGTA CCCGGCCCCA CGCAGTCTTA 

4251 GTGATGGGCT CCAGCATTGA TGGTC6CCCC GTCCTGCCCG CAAACTCTAC 
— eACTAeCGGA-GGTGGTAACT-ACCAGCGGGG CAGGACGGGC-GTTTGAGATG„ 

4301 TACCTTGACC TACGAGACCG TGTCTGGAAC GCCGTTGGAG ACTGCAGCCT 
ATGGAACTGG ATGCTCT6GC ACAGACCTTG CGGCAACCTC TGACGTCGGA 

4351 cx:gccgccgc TTCAGCCGCT GCAGCCACCG CCCGCGGGAT TGTGACTGAC 
GGCGGCGGCG AAGTC6GCGA CGTCG3TGGC GGGCGCCCTA ACACTGACTG 

4401 OTTGCTTTCC TX5AGCCCGCT TGCAAACAGT GCAGCTTCCC GTTCATCCGC 
AAACGAAAGG ACTCGGCCGA ACGTTTGTCA CGTCGAAGGG CAAGTAGGCG 

4451 CCGCGATGAC AAGTTGACGG CTCTTTTGGC ACAATTGGAT TCTTTGACCC 
G6CGCTACTG WCAACTGCC 6AGAAAACCG TGTTAACCTA AGAAACTGGG 

4501 GGGAACTTAA TGTCGTTTCT CAGCAGCTGT TGGATCTGCG CCAGCAGGTT 
CCCTTGAATT ACAGCAAAGA GTCGTCGACA ACCTAGACGC GGTCGTCCAA 

4551 TCTGCCCTGA AGGCTTCCTC CCCTCCCAAT GCGGTTTAAA ACATAAATAA 
A6ACGGGACT TCCGAAGGAG GGGAQGGTTA CGCCAAATTT TGTATTTATT 

4501 AAAACCAGAC TCTGTTTGGA TTTGGATCAA GCAAGTGTCT TGCTGTCTTT 
TTTTGOTCTG AGACAAACCT AAACCTAGTT CGTTCACAGA ACGACAGAAA 

4651 ATTTAGGGGT TTTGCGCGCG CGGTAGGCCC GGGACCAGCG GTCTCGGTCG 
TAAATCCCCA AAACGCGCGC GCCATCCGGG CCCTGGTCGC CAGAGCCAGC 
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4701 
4751 
4801 
4851 
4901 

t 

4951 
5001 
5051 
5101 
5151 
5201 
5251 
5301 
5351 
5401 
5451 
5501 
5551 
5601 



TTGAGGGTCC T^^TATTTT TTCCAGGACG TGGTAAAO^JlHyiSCTC'J^^T 
AACTCCCA66 a|^ATAAAA AAGGTCCTGC ACCATTTCCA CTGAGA^^A 

GTTCAGATAC ATGGGCATAA GCCCGTCTCT GGGGTGGAGG TAGCACCACT 
CAAGTCTATG TACCCGTATT CG6GCAGAGA CCCCACCTCC ATCGTGGTGA 

GCAGAGCTTC ATGCTGCGGG GTGGTGTTGT AGATGATCCA GTCGTAGCAG 
CGTCTCGAAG TACGACGCCC CACCACAACA TCTACTAGGT CAGCATCGTC 

GACCGCTGGG CXTTGCTGCCT AAAAAITGTCT TTCAGTAGCA AGCTGATTGC 
CTCGCGACCC GCACCACGGA TTTTTACAGA AAGTCATCGT TCGACTAACG 

CAGGGGCAGG CCCTTGGTGT AAGTGTTTAC AAAQCGGTTA AGCTGGGATG 
GTCCCCGTCC GGGAACCACA TTCACAAATG TTTCGCCAAT TCQACCCTAC 

GGTGCATACG TGGGGATATG AGAT6CATCT TGGACTGTAT TTTTAGGTTG 
CCACGTATGC ACCCCTATAC TCTACGTAGA ACCTGACATA AAAATCCAAC 

GCTATGTTCC CAOCCATATC CCTCCGGGGA TTCATGTTGT GCAGAACCAC 
CGATACAAGG GTCGGTATA6 GGAGGCCCCT AAGTACAACA C6TCTTGGTG 

CAGCACAGTG TATCCGGTGC ACTTGGGAAA TTTGTCAT6T AGCTTAGAAG 
GTCGTGTCAC ATAGGCCACG TGAACCCTTT AAACAGTACA TCGAATCTTC 

GAAATGCGTG GAAGAACTTG GAGACGCCCT TOTQACCTCC AAGATTTTCC 
CTTTACGCAC CTTCTTGAAC CTCTGCGGGA ACACTGGAGG TTCTAAAAGG 

ATGCATTC6T CCATAATGAT GGCAAK^GGC CCACGGGC6G CGGCCTGGGC 
TACGTAAGCA GGTATTACTA CCGTTACCCG GGTGCCCGCC GCCGGACCCG 

GAAGATATTT CTGGGATCAC TAACGTCATA GTTGTGTTCC A6GATQAGAT 
CTTCTATAAA GACCCTACTG ATTCCAGTAT CAACACAA6G TCCTACTCTA 

CGTCATAGGC CATTTTTACA AACCCCGGGC GGAGGGTGCC AGACTGCGGT 
GCAGTATCCG GTAAAAATGT TTC6CGCCCG CCTCCCACQG TCTGACGCCA 

ATAATGGTTC CATCCGGCCC AGGGGCGTAG TTACCCTCAC AGATTTGCAT 
TATTACCAAG GTAGGCC6GG TCCCCGCATC AATGGGAGTG TCTAAACGTA 

TTCCCACGCT TTGAGTTCAG ATGGGGGGAT CATGTCTACC TGCGGGGCGA 
AAGGGTGCGA AACTCAAGTC TACCCCCCTA GTACAGATGG ACGCCCCGCT 

TGAAGAAAAC GGTTTCCGGG GTAGGGGAGA TCAGCTGGGA AGAAAGCAGG 
ACTTCTTTTG CCAAAGGCCC CATCCCCTCT AGTCGACCCT TCTTTCGTCC 

TTCCTGAGCA GCTGCGACTT ACCGCAGCCG GTGGGCCCGT AAATCACACC 
AAGGACTCGT CGACGCTGAA TGGCGTCGGC CACCCGGGCA TTTAGTGTGG 

TATTACCGGC TGCAACTGGT AGTTAAGAGA GCTGCAGCTG CCGTCATCCC 
/^AATGGCCG ACGTTGACCA TCAATTCTCT CGACGTCGAC GGCAGTAGGG 

TGAGCAGGGG GGCCACTTCG TTAAGCAT6T CCCTGACTCG CATGTTTTCC 
ACTCGTCCCC (XGGTGAAGC AATTCGTACA GGGACTGAGC 6TACAAAAGG 

CTGACCAAAT CCGCCAGAAG GCGCTCGCCG CCCAGCGATA GCAGTTCTTG 
GACTCGTTTA GGCGGTCTTC CGCGAGCGGC GGGTCGCTAT CGTCAAGAAC 
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5651 CAAGGAAGCA AlPfTTTTCA ACGGTTTGAG ACCGTCCGCC GTAGGC^C 
GTTCCTTCGT TTCAAAAAGT TGCCAftACTC TGGCAGGCCG CATCCGTACG 

5701 TTTTGAGCGT TTGACCAAGC AGTTCCAGGC GGTCCCACAG CTCGGTCACC 
AAAACTCGCA AACTGGTTCG TCAAGGTCCG CCAGGGTGTC GAGCCAGTGG 

5751 TGCTCTACGG CATCTCGATC CAGCATATCT CCTCGTTTCG CGGGTTGG6G 
ACGAGATGCC GTAGAGCTAG GTCGTATAGA GGA6CAAAGC GCCCAACCCC 

5B01 CGGCTTTCGC TGTACGGCAG TAGTCGGTGC TCGTCCAGAC GGGCCAGGGT 
GCCGAAAGCG ACATGCCGTC ATCAGCXACG AGCAGGTCTG CCCGGTCCCA 

5851 CATGTCTTTC CACGGGCGCA GGGTCCTCGT CAGCGTAGTC TGGGTCACGG 
GTACAGAAAG GTGCCCGCGT CCCAGGAGCA 6TCGCATCAG ACCCAGTGCC 

5901 TGAAGGGGTG CGCTCCGGGC TGCGCOCTOG CCAGGGTGCG CTTGAGGCTG 
ACTTCCCCAC GCGAGGCCC6 ACGCGCGACC G6TCCCACGC GAACTCCGAC 

5951 6TCCTGCTGG TGCTGAAGCG CTGCCGGTCT TCQCCCTGCG CGTC6GCCAG 
CAGGACGACC ACGACTTC6C GACGGCCAGA AGC6GGACGC GCAGCCGGTC 

6001 GTAGCATTTG ACCATG6TGT CATAGTCCAG CCCCTCCGC6 GCGTGGCCCT 
CATCGTAAAC TGGTACCACA GTATCA6GTC GGGGAGGCGC CGCACCGGGA 

6051 TGGCGCCCAG CTTGCCCTTG GAGGAGGCGC CGCACGAGGG 6CAGTGCAGA 
ACCGCGCGTC GAACGGGAAC CTCCTCCGCG GCGTGCTCCC CGTCACGTCT 

6101 CTTTTGAGGG CGTAGAGCTT GGGCGCGAGA AATACCGATT CCGGGGAGTA 
GAAAACTCCC GCATCTCGAA CCCGCGCTCT TTATGGCTAA 6GCCCCTCAT 

6151 GGCATCCGCG CCGCAGGCCC CGCAGACGGT CTCGCATTCC_ACGAGCCAGG 
CCGTAGGCGC GGCGTCCGGG GCGTCTGCCA GAGCGTAAGG TGCTCGGTCC 

6201 T6AGCTCTGG CCGTTCGGGG TCAAAAACCA 6GTTTCCCCC ATGCTTTTTG 
ACTC6A6ACC GGCAAGCCCC AGTTTTTGGT CCAAAGGGGG TACGAAAAAC 

6251 ATGCGTTTCT TACCTCTGGT TTCCATGAGC CGGTGTCCAC GCTCG6TGAC 
TACGCAAAGA ATGGAGACCA AAGGTACTCG GCCACAGGTG CGAGCCACTG 

6301 GAAAAGGCTG TCCGTGTCCC CGTATACAGA CTTGAGAGGC CTGTCCTCGA 
CTTTTCCGAC A6GCACAGGG GCATATGTCT GAACTCTCCG GACAGGAGCT 

6351 GCGGTGTTCC GCGGTCCTCC TCGTATAGAA ACTCGGACCA CTCTGAGACA 
CGCCACAAGG C6CCAGGAGG AGCATATCTT TGAGCCTGGT GAGACTCTGT 

6401 AAGGCTCGCG TCCAGGCCAG CACGAAGGAG GCTAAGTGGG AGGGGTAGCG 
TTCC6AGCGC A6GTCCGGTC GTGCTTCCTC CGATTCACCC TCCCCATCGC 

6451 6TCGTTGTCC ACTAGGGGGT CCACTCGCTC CAGGGTGTGA AGACACATGT 
CAGCAACAGG T6ATCCCCCA GGTGAGCGAG GTCCCACACT TCTGTGTACA 

6501 CGCCCTCTTC GGCATCAAGG AAGGTGATTG GTTTGTAGGT GTAGGCCACG 
GCGGGAGAAG CCGTAGTTCC TTCCACTAAC CAAACATCCA CATCCGGTGC 

6551 TGACCGGGTG TTCCTGAAGG GGGGCTATAA AAGGGGGTGG GGGCGCGTTC 
ACTGGCCCAC AAG6ACTTCC CCCCGATATT TTCCCCCACC CCCGCGCAAG 
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6601 GTCCTCACTC TCgTCCGCAT CGCTGTCTGC CACGGCCJVSa^GTSTQGB^ 
CAGGAGTGAG i^^GGCGTA GCGACAGACG CTCCCGGTCG ACAACQ^C 

6651 AGTACTCCCT CTGAAAAGCG GGCATGACTT CTGCGCTAAG ATTGTCAGTT 
TCATGAGGGA GACTTTTCGC CCGTACTGAA 6ACGCGATTC TAACA6TCAA 

6701 TCCAAAAACG AGGAGGATTT GATATTCACC TGGCCCGCGG TGATGCCTTT 
AGGTTTTTGC TCCTCCTAAA CTATAAGTGG ACCGGGCGCC ACTACGGAAA 

6751 GAGGGTGGCC GCATCCATCT GGTCAGAAAA GACAATCTTT TTGTTGTCAA 
CTCCCACCGG CGTAGGTAGA CCAGTCTTTT CTGTTAGAAA AACAACAGTT 

6801 GCTTGGTGGC AAACGACCCG TAGAGGGCGT TGGACAGCAA CTT6GCGATG 
CGAACCACCG TTTGCTGGGC ATCTCCCGCA ACCTGTCGTT GAACCGCTAC 

6851 GAGCGCAGGG TTTGCTTTTT GTCGCGATCG GCGCGCTCCT TGGCCGCGAT 
CTCGCGTCCC AAACCAAAAA CAGCGCTAGC CCCGCGAGGA ACCGGCGCTA 

6901 GTTTA0CT6C ACGTATTCGC GCGCAACQCA CCGCCArTCG GGAAA6ACGG 
CAAATCGACG TGCATAA6CG CGCGTTGOGT 6GCGGTAA6C CCTTTCTGCC 

€951 TGGTOCOCTC GTCGGGCACC AGGTGCACGC GCCAACCGCG GTTGTGCAGG 
ACCACGCQAG CAGCCCGTGG TCCACGTGCG CGGTTGGCGC CAACACGTCC 

7001 GrGACAAGGT CAACGCTGGT GGCTACCTCT CCCCGTACCC GCTCGTTGGT 
CACTGTTCCA 6TTGCGACCA CCGATG6AGA GQCGCATCCG CGAGCAACCA 

7051 CCAGCAGAQG CGGCCGCCCT TGCGCGAGCA GAATGGCGGT AGGGGGTCTA 
GGTCGTCTCC GCCGGCGGGA ACGCGCTCGT CTTACCGCCA TCCCCCAGAT 

7101 GCTGCGTCTC GTCCGGGGGG TCTGCGTCCA CGGTAAAGAC CCCGGGCAGC 
CGACGCAGAG CAGGCCCCCC AGACGCAGGT GCCATTTCTG GGGCCCGTCG 

7151 ACGCGCGCGT CGAAGTAGTC TATCTTGCAT CCTTGCAAGT CTA6CGCCTG 
TCCQCGCGCA GCTTCATCAG ATAGAACGTA GGAACGTTCA GATCGCGGAC 

7201 CTGCCATGCG OGGGCGGCAA GCGCGCGCTC GTATGGGTTG AGTGGGGGAC 
GACGGTACGC 3CCCC3CCGTT CGCGCGCGAG CATACCCAAC TCACCCCCTG 

7251 CCCATGGCAT GGGGTGGGTG AGCGCGGAGG CGTACATGCC GCAAATGTCG 
GGGTACCGTA CCCCACCCAC TC6CGCCTCC GCATGTACGC CGTTTACAGC 

7301 TAAACGTAGA GGGGCTCTCT GAGTATTCCA AGATATGTAG 6GTAGCATCT 
ATTTGCATCT CTCCGAGAGA CTCATAAGGT TCTATACATC CCATCGTAGA 

7351 TCCACCGCGG ATGCTOGCGC GCACGTAATC 6TATAGTTCG TGCGAGGGA6 
AGGTGGCGCC TACGACCGCG CGTGCATTAG CATATCAAQC ACGCTCCCTC 

7401 CGAGGAGGTC GGGACCGAGG TTGCTACG6G CGGGCTGCTC TGCTCGGAAG 
GCTCCTCCA6 CCCTGGCTCC AACGATGCCC GCCCGACGAG ACGAGCCTTC 

7fi51 ACTATCTCCC TGAAGATGGC AT6TGAGTTG GATGATATGG TTGGACGCTG 
TGATAGACGG ACTTCTACC6 TACACTCAAC CTACTATACC AACCTGCGAC 

7501 GAAGACGTTG AAGCTGGCGT CTGTGAGACC TACCGCGTCA CGCACGAAGG 
CTTCTGCAAC TTCGACCGCA GACACTCTGG ATGGCGCAGT GCGTGCTTCC 
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7551 AGGCGTAGGA G^CGCAGC TTGTTGAC'CA GCTCGGCGGT GACCTG 
TCCGCATCCT CAGCGCGTCG AACAACTGGT CGAGCCGCCA CTG6AC 

7601 TCTAGGGCGC AGTAGTCCAG GGTTTCCTTG ATGATGTCAT ACTTATCCTG 
AGATCCCGCG TCATCAGGTC CCAAAGGAAC TACTACAGTA TGAATAGGAC 

7651 TCCCTTTTTT TTCCACAGCT CGCGGTTGAG GACAAACTCT TCGCGGTCTT 
AGGGAAAAAA AAGGTGTCGA GCGCCAACTC CTGTTTGAGA AGCGCCAGAA 

7701 TCCAGTACTC TTOGATCGGA AACCCGTCGG CCTCCGAACC CTAAGACCCT 
AGGTCATGAG AACCTAGCCT TTGGSCAGCC GGAGGCTTGC CATTCTCG6A 

7751 AGCATGTAGA ACTGGTTGAC GGCCTGGTAG GCGCAGCATC -CCTTTTCTAC 
TCGTACATCT TGACCAACTG CCGGACCATC CGCGTCGTAG GGAAAAGATG 

7801 GGGTAGCGCG TATGCCTGCG CGGCOTTCCG C3AGCGAGGTG TGGCTGAGCG 
CCCATCGCGC ATACGGACGC GCCGGAAGGC CTCGCTCCAC ACCCACTCGC 

7851 CAAAGGTGTC CCTGACCATG ACTTTGAGGT ACTGGTATTT GAAGTCAGTG 
CTTTCCACAG GGACTGGTAC TGAAACTCCA TGACCATAAA CTTCAGTCAC 

7901 TCGTC6CATC CGCCCTGCTC CCA6AGCAAA AAGTCCGTGC GCTTTTTGGA 
AGCAGCGTAG GCGGGACGAG GGTCTCGTTT TTCAGGCACG CGAAAAACCT 

7951 ACGCGGATTT GGCAGGGCGA AGGTGACATC GTTGAAGAGT ATCTTTCCCG 
TQCGCCTAAA CCGTCCCGCT TCCACTGTAG CAACTTCTCA TAGAAAGG5C 

8001 CGC6AGGCAT AAAGTTCCGT GTGATGCGGA AGGGTCCCGG CACCTCGGAA 
6CGCTCCGTA rTTCAACGCA CACTACGCCT TCCCAGGGCC GTGGAGCCTT 

8051 CGGTTGTTAA TTACCTGGGC GGCGAGCACG ATCTCGTCAA AGCCGTTGAT 
GCCAACAATT AATGGACCCG CCGCTCGTGC TAGAGCAGTT TCGGCAACTA 

8101 GTTGTGGCCC ACAATGTAAA GTTCCAAGAA GCGCGGGATG CCCTTGATOG 
CAACACCGG6 TGTTACATTT CAAGGTTCTT CGCGCCCTAC GGGAACTACC 

8151 AAGGCAATTT TTTAAGTTCC TCGTAGGTGA GCTCTTCAGG GGAGCTGAGC 
rrCCGTTAAA AAATTCAAGG AGCATCCACT CGAGAAGTCC CCTCGACTCG 

8201 CCGTGCTCTG AAAGGGCCCA GTCTGCAAGA TGAGGGTTGG AAGCGACGAA 
GGCACGAGAC TTTCCCGGGT CAGACGTTCT ACTCCCAACC TTCGCTGCTT 

8251 TGAGCTCCAC AGGTCACGGG CCATTAGCAT TTGCAGGTGG TCGCGAAAGG 
ACTCGAGGTG TCCAGTGCCC GGTAATCGTA AACGTCCACC AQCGCTTTCC 

8301 TCCTAAACTG GCGACCTATG GCCATmTT CTG6GGTGAT GCAGTA6AAG 
AGGATTTGAC CGCTGGATAC OGGTAAAAAA GACCCCACTA CGTCATCTTC 

8351 GTAAGCGGGT CTTGTTCCCA GCG(3TCCCAT CCAAGGTTCG CQGCTAGGTC 
CATTCGCCCA GXACAAGGOT COCCAGGOTA GOTTOCAAGC GCCGATCCAG 

8401 TCGCGCGGCA GTCACTAGAG GCTCATCTTC GCCGAAC^ 

AGCGCGCCGT CACTGATCTC CGASTASAGG CGGCTTGAAG TACTGGTCGT 

8451 TGAAGGGCAC GA6CT6CTTC CCAAAGGCCC CCATCCAAGT ATAGGTCTCT 
ACTTCCCGT6 CTCGACGAAG GGTrTCCQGG GGTAGGTTCA TATCCAGAGA 
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8501 ACATCGTAGG l^AAAGAG ACGCTCGGTG CGAGGATOCG AGCCG;*^G 
TGTAGCATCC i^TCTTTCTC TGCGAGCCAC CCTCCTACGC TCGGCtJBcC 

8551 GAAGAACTGG ATCTCCCGCC ACCAATTGGA GGAGTGGCTA TTGATGTGGT 
CTTCTTGACC TAGAGGGCGG TGGTTAACCT CCTCACCGAT AACTACACCA 

8501 GAAAGTAGAA GTCCCTGCGA CGGGCCGAAC ACTCGTGCTG GCTTTTGTAA 
CTTTCATCTT CAGGGACGCT GCCCGCCTTC TGAGCACGAC CGAAAACATT 

8651 AAACGTGCGC AGTACTGGCA GCGCTGCACC GGCTGTACAT CCTGCACGAG 
TTTGCACGC6 ICATGACCGT CGCCACGTGC CCGACATGTA 06ACGTGCTC 

8701 GTTGACCTGA CGACCGCGCA CAAGGAA6CA GAGTGGGAAT TTGAGCCCCT 
CAACTGGACT GCTGGCGCGT CTTCCTTCCT CTCACCCTTA AACTCGGGGA 

8751 C6CCTGGCGG GTTTGGCTGG TGGTCTTCTA CTTCGGCTGC TTGTCCTTGA 
QCGGACCGCC CAAACCGACC ACCA6AA6AT GAAGCCGACG AACAGGAACT 

8801 CC6TCTGGCT GCTCGA6GGG AGTTACGGTG GATC66ACCA CCACGCCGCG 
GGCAGACCGA CGAGCTCCCC TCAATGCCAC CTAGCCTGGT GGTOCGGCGC 

8851 CGAGCCCAAA GTCCAGATGT CCGCGC6CGG CGGTCGGAGC TTGATCSACAA 
GCTCGGGTTT CAGGTCTACA GGCGC6CGCC GCCAGCCTCG AACTACTGTT- 

8901 CATCGCGCAG ATGGGAGCTG TCCATGGTCT GGAGCTCCCG CGGCGTCAGG 
GTAGCGCGTC TACCCTCGAC AGGTACCAGA CCTCGAGGGC GCCGCAGTCC 

8951 TCA6GCGGGA GCTCCTGCAG GTTTACCTCG CATAGACGGG TCAGGGCGCG 
AGTCCGCCCT CGAGGACGTC CAAATGGAGC GTATCTGCCC AGTCCCGCGC 

9001 GGCTAGATCC AGGTGATACC TAATTTCCAG GGGCTGGTTG GTGGCGGCGT 
CCGATCTAGG TCCACTATGG ATTAAAGGTC CCCGACCAAC CACCGCCGCA 

9051 CGATGGCTTG CAAGAGGCCG CATCCCCGCG GCGCGACTAC GGTACCGCGC 
GCTACCGAAC GTTCTCCGGC GTAGGGGCGC CGCGCTGATG CCATGGCGCG 

9101 GGCGGGCGGT GGGCCGCGGG GGTGTCCTTG GATGATGCAT CTAAAAGCGG 
CCGCCCGCCA CCCGGCGCCC CCACAGGAAC CTACTACGTA GATTTTCGCC 

9151 TGACGCGGGC GAGCCCCCGG AGGTA6GGGG GGCTCCGGAC CCGCCG6GAG 
ACTGCGCCCG CTCGGGGGCC TCCATCCCCC CCGAGGCCTG GGCGGCCCTC 

9201 AGGGGGCAGG GGCACGTCGG CGCCGCGCGC GGGCAGGAGC TGGTGCTGCG 
TCCCCCGTCC CCGTOCAGCC GCGGCGCGCG CCCGTCCTCG ACCACGACGC 

9251 CGCGTAGGTT GCTGGCGAAC GCGACGACGC GGCGGTTGAT CTCCTGAATC 
GCGCATCCAA CGACCGCTTG CGCTGCTGCG CCGCCAACTA GAG6ACTTAG 

9301 TGGCGCCTCT GCGTGAAGAC GACGGGCCCG GTGAGCTT6A ACCTGAAAQA 
ACCGCGGAGA CGCACTTCTG CTGCCCGGGC CACTCGAACT TCGACTTTCT 

9351 GAGTTCGACA GAATCAATTT CGGTGTCGTT GACGQCGGCC TGOCGCAAAA 
CTCAAGCTGT CTTAGTTAAA GCXa^CAQCAA CTGCCGCCGG ACCGCGTTTT 

9401 TCTCCTGCAC GTCTCCTGAG TTGTCTTGAT AGGCGATCTC GGCCATGAAC 
AGAGGACGTG CAGAGGACTC AACAGAACTA TCCGCTAGAG CCGGTACTTC 
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9451 TGCTCGATCT C^CTCCTG GAGATCICCG COTCCGGCTC °CTCCAM|T 
ACGAGCTAGA G«GGAGGAC CTCTAGAGGC GCAGGCCGAG CGAGGTgCcA 

9501 GGCGGCGAGG TCGTTGGAAA TGCGGGCCAT 6AGCTGCGAG AAGGCGTTGA 

ccgccgctcc agcaaccttt acgcccgota ctcgacgctc ttccgcaact 

955X ggcctccctc gttccagacg cggctgtaga ccacgccccc ttcggcatcg 
ccggagggag caaggtctgc gccgacatct ggtocogcgg aagccgtagc 

9601 c6ggcgcgca tgaccacctg cgcgagattc agctccacgt gccgggcgaa 
gcccgcgcgt actggtggac gcgctctaac tcgagctoca cggcccgctt 

9651 g7kc6gcgta6 tttcgcaggc gctcajagag gftagttgagg gtggtggcgg 
ctgccgcatc aaagcgtccg cgactttctc catcaactcc caccaccgcc 

9701 tgtgttctgc cacgaagaag tacataaccc agcgtcgcaa cgtggattcg 
acacaagacg gtgcttcttc atgtattggo tcgcagcgtt gcacctaagc 

9751 ttgatatccc ccaaggcctc aaggcgctcc atggcctcgt agaagtccac 
aactataggg ggttccggag ttccgcgagg taccggagca tcttcaggtg 

9B01 ggcgaagttg aaaaactggg agttgcgcgc cgacacggtt aactcctcct 

CCGCTTCAAC nTTTGACCC TCAACGCGCG GCTGTGCCAA TTGAGGAGGA 

9851 CCAGAAGACG GATGAGCTCG GCGACAGTGr CGCGCACCTC GCGCTCAAAG 
GGTCTTCTGC CTACTCGAGC CGCTGTCACA GCGCGTGGAG CGCGAGTTTC 

9901 GCTACAGGGG CCTCTTCTTC TTCTTCAATC TCCTCTTCCA TAAGGGCCTC 
CGATGTCCCC GGAGAAGAAG AA6AAGTTAG AGGAGAAGGT ATTCCCGGA6 

9951 CCC'TCTTC'r TCTTCTGGCG GCGGTGGGGG AGGGGGGACA CGGCGGCGAC 
GGG^GAAGA AGAAGACCGC CGCCACCCCC TCCCCCCTGT GCCGCCGCTQ- 

10001 GACGGCGCAC CGGGAGGCGG TCGACAAAGC GCTCGATCAT CTCCCCGCG6 
CTGCCGCGTG GCCCTCCGCC AGCTGTTTCG CGAGCTAGTA GAGGGGCGCC 

10051 CGACGGCGCA TGGTCTCGGT GACGGCGCGG CCGTTCTCGC GGGGGC6CAG 
GCTGCCGCGT ACCAGA6CCA CTGCCGCGCC GGCAAGA6CG CCCCCGCGTC 

10101 TTGGAAGACG CCGCCCGTCA TGTCCCG6TT ATGGGTTCCC CGGGGGCTGC 
AACCTTCTGC GGCGGGCAGT ACAGGGCCAA TACCCAACCG CCCCCCGACG 

10151 CATGCGGCAG GGATACGGCG CTAACGATGC ATCTCAACAA TTGTTGTGTA 
GTACGCCGTC CCTATGCCGC GATTGCTACG TAa&GTTGTr AACAACACAT 

10201 GGTACTCCGC CGCCGAGGGA CCTGAGCGAG TCCGCATCGA CCGGATCGGA 
CCATGAGGCG GCGGCTCCCT GGACTCQCTC AGGCGTAGCT GQCCTAQCCT 

10251 AAACCTCTCG AGAAAGGCGT CTAACCAGTC ACAGTCGCAA GGTAGGCTQA 
TTTGGAGAGC TCTTTCCGCA GATT66TCAG TCTCAGCGTT CCATCCGACT 

10301 6CACCGTGGC OGGCOGCAGC GGGOGGCGGT CGGGGTTGTT TCTGGCGGAG 
CGTGGCACCG CCCGCCGTCO CCCGCCGCCA GCOCCAACAA AGACCGCCTC 

10351 GTGCTGCTGA TCATCTAATT AAAGTAGQC6 GTCTTGAGAC GGCGGATGGT 
CACGACGACT ACTACATTAA TTTCATCCGC CAGAACTCTG CCCCCTACCA 
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1040X CGACAGT^GC 



TGTCCT 



TGGGTCCQGC 



CTGCTGAA?rG"t:(5dAGi 



10451 
10501 
10551 
10601 
106S1 
10701 
10751 
10801 
10851 
10901 
10951 
11001 
11051 
11101 
11151 
11201 
11251 
11301 



GCTGTCTTCG tIBaCAGGA ACCCAGGCCG GACGACTTAC GCGTCCSRA 

CGGCCATGCC CCAGGCTTCG TTTTGACATC GGCGCAGGTC TTTGTAGTAG 
GCCGGTACGG G&TCC6AAGC AAAACTGTAG CC6CGTCCAG AAACATCATC 

TCTTGCATGA GCCTTTCTAC C6GCACTTCT TCTTCTCCTT CCTCTTGTCC 
A6AACGTACT CGGAAAGAT6 GCCGTGAAGA AGAAGAGGAA GGAGAACAGG 

TGCATCTCTT GCATCTATCG CTGCGGCGGC GGCGGAGTTT GGCCGTAGGT 
ACGTAGAGAA CGTA6ATAGC GACGCCGCCG CCGCCTCAAA CCGGCATCCA 

GGCGCCCTCT TCCTCCCATG CGTGTGACCC CGAAGCCCCT CATCGGCTGA 
CCGCGGGAGA AGGAGGGTAC GCACAC3XSGG GCTTCGGGGA GTAGCCGACT 

AGCAGGGCTA G3TCGGCGAC AACGCGCTCG GCTAATATGG CCTGCTGCAC 
TCGTCCCGAT CCAGCCGCTQ TT6CGCQAGC CGATTATACC GGACGACGTG 

CTGCGTGAGG GTAGACTGGA AGTCATCCAT GTCCACAAAG CGGTGGTATG 
GACGCACTCC CATCTGACCT TCAGTAGGTA CAGGTGTTTC GCCACCATAC 

CGCCCGTGTT 6ATGGTGTAA GTGCAGTTGG CCATAACGGA CCAGTTAACG 
GCGGGCACAA CTACCACATT CACGTCAACC GGTATTGCCT GGTCAATTGC 

GTCTGGTGAC CCGGCTGCGA GAGCTCGGTG TACCTGAGAC G.CGAGTAAGC 
CAGACCACTG GGCCGACGCT CTCGAQCCAC ATGGACTCT6 CGCTCATTCG 

CCTCGAGTCA AATACGTAGT CGTTGCAAGT CCGCACCAGG TACTGGTATC 
GGAGCTCAGT TTATGCATCA GCAACGTTCA GGCGTGGTCC ATGACCATAG 

CCACCAAAAA GTGCGGCGGC GGCTGGCG6T AGAGGGGCCA GCGTAGGGTG 
GGTGGTTTTT CACGCCGCCG CCGACCGCCA TCTCCCC6CT CCCATCCCAC 

GCCGGGGCTC OGGGGGCGAG ATCTTCCAAC ATAAG6CGAT GATATCCGTA 
CGGCCCCGAG GCCCCCGCTC TAGAAGGTTG TATTCCGCTA CTATAGGCAT 

GATGTACCTG GACATCCAGG TGATGCC6GC GGCGGTGGrG GAGGCGCGCG 
CTACATGGAC CTGTAGGTCC ACTACGGCCG CCGCCACCAC CTCCGCGCGC 

GAAAGTCGCG GACGCGGTTC CAGATGTT6C GCAGCGGCAA AAAGTGCTCC 
CTTTCAGC6C CTGCQCCAAG GTCTACAACG CGTCGCCQTO TTTCACGAGG 

ATGGTCGGGA CGCTCTGGCC GGTCAGGCGC GCGCAATCGT TGACGCTCTA 
TACCAGCCCT GC6A6ACCGG CCAGTCCGCG CGCGTTAGCA ACTGCGAGAT 

GACCGTCCAA AASGAGAGCC TGTAAGCGOG CACTCTTCCG TGGTCTGGTG 
CTGOCACGTT TTCCTCTCGG ACATTCGCCC 6TGAGAAGGC ACCAGACCAC 

GATAAATTC6 CAAGGGTATC AT6GCGGACG ACCGGGGTTC GAGCCCCGTA 
CTATTTAAQC GTTCCCATAG TACOGCCTOC TGGCCCCAAG CTCGGGGCAT 

TCCGGCCGTC CGCCGTGATC CATGCG6TTA CXTGCCCGCGT GTCGAACCCA 
A6GCCGGCAG GCGGCATTAG 6TACGCCAAT GGCGGGCGCA CAGCTTGGGT 

GGTGTGCGAC CTCAGACAAC GGGGGA6TGC TCCTTTTGGC TTCCTTCCAG 
CXACACGCTG CAGTCTGTTG CCCCCTCACG AGGAAAACCG AAGGAAGGTC 
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11351 
11401 
11451 
11501 
11551 
11601 
11651 
11701 
11751 
11801 
11851 
llSOl 
11951 
12001 
12051 
12101 
12151 
12201 
12251 



CGCOCCGCCG aW^cgcgat cgaaaaa&cc ggtgaccggc gcgcgtWCa 

AAGCGGTTAG GCTGGAAAGC GAAAGCATTA AGTGGCTCGC TCCCTGTAGC 
TTCGCCAATC CGACCTTTCG CTTTCGTAAT TCACCGAGCG AGGGACATCG 

CGGAGGGTTA TTTTCCAAGG GTTGAGTCGC GGGACCCCCG GTTCGAGTCT 
GCCTCCCAAT AAAAGGTTCC CAACTCAGCG CCCTGGGGGC CAAGCTCAGA 

CGGACCGGCC GGACTGCGGC 6AACGGGGGT TTGCCTCCCC GTCATGCAAG 
GCCTGGCCGG CCTGACGCCG CTTGCCCCCA AAC6GAGGGG CAGTACGTTC 

ACCCCGCTTG CAAATTCCTC CGGAAACAGG GACGAGCCCC TTTTTTGCTT 
TGGGGCGAAC GTTTAA6GAG GCCTTTGTCC CTGCTCGGGG AAAAAACGAA 

TTCCCAGATG CATCCGGTGC TGCGGCAGAT GCGCCCCCCT CCTCAGCAGC 
AAGGGTCTAC GTAGGCCACC ACGCCGTCTA CGCGGGGGGA GGAGTCGTCG 

GGCAAGAGCA AGAGCAGCGG CAGACATGCA GGGCACCCTC CCCTCCTCCT 
CCGTTCTCGT TCTCGTCGCC GTCTGTACGT CCCGTGGGAG GQGAGGAGGA 

ACCGCGTCAG GAGGGGCGAC ATCCGCGGTT GACGCGGCAG CAGATGGTGA 
TGGCGCAGTC CTCCCCGCTG TACOCGCCAA CTGCGCCGTC GTCTACCACT 

TTACGAACCC CCGCG6CGCC GGGCCCGGCA CTACCT6GAC TTGGAGGAGG 
AATGCTTGGG GGCGCCGCGG CCCGGGCCGT GATGGACCTG AACCTCCTCC 

GCGAGGGCCT G6CGCGGCTA GGAGCGCCCT CTCCTGAGCG GCACCCAAGG 
CGCTCCCGGA CCGCGCCGAT CCTCGCGGGA GAG6ACTCGC CGTGGGTTCC 

GTGCA6CTGA AGCGTGATAC GCGTGAGQCG TACGTGCC6C GGCAGAACCT 
CACGTCGACT TCGCACTATG-CGCAGTCOGC- ATGCACGGCG CCGTCTTGGA- 

6TTTCGCGAC CGCGAGGGAG AGGAGCCCGA GGAGATGCGG GATC6AAAGT 
CAAAGCGCTG GCGCTCCCTC TCCTCGGGCT CCTCTACGCC CTA6CTTTCA 

TCCACGCAGG GCGCGAGCTG CGGCATGGCC TGAATCGCGA GCGGTTGCTG 
AGGTGCGTCC CGCXSCTCGAC GCCGTACCGG ACTTAGCGCT CGCCAACGAC 

CGCGAGGAGG ACTTTGAGCC CGACGCGCGA ACCG6GATTA GTCCCGCGCG 
GCGCTCCTCC TGAAACTCGG GCTGCGCGCT TGGCCCTAAT CAGGGCGCGC 

CGCACACGTG GCGGCC6CCG ACCTGGTAAC CGCATACGAG CAGACGGTGA 
GCGTGTGCAC CGCCGGCGGC TGGACCATTG GCGTATGCTC GTCTGCCACT 

ACCAOGAGAT TAACTTTCAA AAAAGCTTTA ACAACCACGT GCGTACGCTT 
TGGTCCTCTA ATTGAAAGTT TTTTCGAAAT TCTTG6TGCA CGCATGCGAA 

6TGGCGCGCG AGGAGGTGGC TATAGGACTG ATGCATCTGT GGGACTTTGT 
CACCGCGCGC TCCTCCACCG ATATCCTGAC TACGTAGACA CCCTGAAACA 

AAGCGCGCTG GAGCAAAACC CAAATAGCAA GCCGCTCATG GCGCAGCTGT 
TTCGCGCGAC CTCGTTTTGG GTTTATCGTT - CGGC6AGTAC CGCGTCGACA 

TCCTTATAGT GCAGCACAGC AGGGACAACG AGGCATTCAG GGATGCGCTG 
AGGAATATCA CGTCGTGTCG TCCCTGTTGC TCCGTAAGTC CCTACGCGAC 
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12301 CTAAACATAG T^GCCCGA GGGCCGCTGG CTGCTCGATT TCATAAMT 
GATTTGTATC ATOCGGGCT CCCGGCGACC GACGAOCTAA ACTATTOTTA 

12351 CCTCCAGAGC ATAGTGGTGC AGGAGCGCAG CTTGA6CCTG QCTGACAAGG 
GGACGTCTCG TATCACCACG TCCTCGCGTC GAACTCGGAC CGACTGTTCC 

12401 TGGCCGCCAT CAACTATTCC ATQCTTAGCC TGGGCAAGTT TTACCCCCGC 
ACCCGCCCTA 6TTCATAA6G TACGAATCG6 ACCCGTTCAA AATGCGGGCG 

12451 AAGATATACC ATACCCCTTA CGTTCCCATA GACAAGGAGG TAAAGATCGA 
TTCTATATGG TATGGGGAAT OCAAGGSTAT CTGTTCCTCC ATTTCTAQCT 

12501 CQ6GTTCTAC ATGCOCATGG CGCTGAAGGT GCTTACCTTG AdCGACGACC 
CCCCAA6ATG TACGCGTACC 6CGACTTCCA CGAATGGAAC TCGCTGCTGG 

12551 TCCCCGTTTA TCGCAACGAG CGCTITCCACA AGGCCGTGAG CGTGAGCCGG 
ACCCGCAAAT AGC6TTGCTC GCGTAGGTGT TCCGCCACTC GCACTCGGCC 

12601 CGGCGC6AGC 1CAGC6ACC0 CGAGCTGAT6 CACAGCCTGC AAAGGGCCCT 
GCCGCGCTCG AGTC6CTGGC GCTCGACTAC 6TGTCGGACG TTTCCCG6GA 

12651 G6CTGGCACG GGCAGCGGCG ATAGAGAGGC CGAGTCCTAC TTTGACGCGG 
CCGACCGTGC CCGTCGCCOC TATCTCTCCG GCTCAGGATG AAACTGCGCC 

12701 6CGCTGACCT GCGCTGGGCC CCAAGCCGAC GCGCCCTGGA GGCAGCTGGG 
CCCGACTGGA CGCGACCCGG GGTTCGGCTG CGCGGGACCT CCGTCGACCC 

12751 GCCGGACCTG GGCTGGCGGT GGCACCCGCG CGCGCTGGCA AC6TCGGCGG 
CGGCCTGGAC CCGACCGCCA CCGTGGGCGC GCGCGACCGT TGCAGCCGCC 

12801 CGTGGAGGAA TATGACGAGG ACGATGAGTA CGAGCCAGAG GACGGCGAGT 
GCACCTCCTT ATACTGCTCC TGCTACTCAT GCTCGGTCTC CTGCCGCTCA 

12851 ACTAAGCGGT GATGTTTCTG ATCAGATGAT GCAAGACGCA ACGGACCCGG 
TGATTCGCCA CTACAAAGAC TAGTCTACTA CGTTCTGCGT TGCCTGGGCC 

12901 CGGTGCGGGC GGCGCTGCAG AGCCAGCCGT CCGGCCTTAA CTCCACGGAC 
GCCACGCCCG CCGCGACGTC TCGGTCGGCA GGCCGGAATT GAGGTGCCTG 

12951 GACTGGCGCC AGGTCATCCA CCGCATCATG TCGCTGACTG CGCGCAATCC 
■ CTGACCGCGG TCCAGTACCT GGCGTAGTAC AGCGACTGAC GCGCGTTAGG 

13001 TCACGCGTTC CGGCAGCAGC CGCAGGCCAA CCGGCTCTCC GCAATTCT6G 
ACTGCGCAAG GCCGTCGTCG GCGTCCGGTT GGCC6AGAGG CGTTAAGACC 

13051 AAGCGGTGGT CCCGGCGCGC GCAAACCCCA C6CACGAGAA GGTGCTGGCG 
TTCGCCACCA GGGCCGC6CG C6TTTGGGGT GCOTGCTCTO CCACGACCGC 

13101 ATCGTAAACG CGCTGGCCGA AAACAGGGCC ATCCGGCCC6 AOGAGGCCGG 
TAGCATTTGC GC6ACCGGCT TTTOTCCCGC TAGGCXTGGQC TQCTCCQGCC 

13151 CCTGGTCTAC GACGCGCT6C TTCACC6CGT GGCTCGTTAC AACAGCGGCA 
GGACCAGATG CTGC6CGACG AAOTCCCGCA CCGAGCAATG TTGTC6CCGT 

13201 ACGTGCAGAC CAACCTGGAC CGGCTG6TGG 6GGAT6TGCQ CGAGGCCGTG 
TGCACGTCTG GTTGGACCTC CCCGACCACC CCCTACACGC GCTCCGGCAC 



68/144 



wo 02/22080 



PCT/USOl/28861 



13251 



GCGCAGC6TG '^^^ 
CGCGTCGCAC tcK' 



ICGCGCA 6CAGCAGGGC 
GCGCGT CGTCGTCCCG 



AACCTGGGCT CCkTQ 
TTGG;^CCC6& GGTACi 




13301 ACTAAACGCC TTCCTGAGTA CACAGCCX:6C CAACGTGCCG CG6GGACA6G 
TGATTTGCGG AftGGACTCAT GTGTCGSGCG GTTGCACGGC GCCCCTGTCC 

13351 AGGACTACAC CAACTTTQTG AGCGCACT6C GGCTAATGGT GACTGAGACA 
. TCCTGATGTG GTTGAAACAC TCGCGTGACG CCGATTACCA CTGACTCTGT 

13401 CCGCAAAGTG AGGTGTACCA GTCTGGGCCA GACTATTTTT TCCAGACCAG 
GGCGTTTCAC ICCACATGGT CAGACCCGGT CTGATAAAAA AGGTCTGGTC 

13451 TACACAAGGC CTGCAGACCG TAAACCTGAG CCAGGCTTTC AAAAACTTGC 
ATCTGTTCC6 GACGTCTGGC ATTTGGACTC GGTCC6AAAG TTTTTGAACG 

13501 AGGGGCTGTG QGGGGTGCGG GCTCCCACAG GCGACCGCGC GACCGTGTCT 
TCCCCGACAC CCCCCACGCC CGAGGGTCTC CCCTOGCGCG CTGGCACAGA 

13551 AGCTTGCTGA CGCCCAACTC GCGCCTGTTG CTGCTGCTAA TAGCGCCCTT 
TCGAACGACT GCGGGTTGAG CGC6GACAAC GACGACGATT ATCGCGGGAA 

13601 CACGGACAGT GGCAGCGTGT CCCGGGACAC ATACCTAGGT CACTTGCTGA 
GTGCCTGTCA CCGTCGCACA GGGCCCTGTG TATGGATCCA GTGAACGACT 

13651 CACTGTACCG CGAGGCCATA GGTCAGGCGC ATGTGGACGA GCATACTTTC 
GTGACATGGC GCTCCG6TAT CCAGTCCGCG TACACCTGCT CGTATGAAAG 

13701 CAGGAGATTA CAAGTGTCAG CCGCGCGCTG GGGCAGGASG ACACGGGCAG 
GTCCTCTAAT GTTCACAGTC GGCGCGCGAC CCCGTCCTCC TGTGCCCGTC 

13751 CCTGGAGGCA ACCCTAAACT ACCTGCTGAC CAACCGGCGG CAGAAGATCC 

— GGACCTCCGT TGGGATTTGA-TGGACGACTG GTTGGCCGCC GTCTTCTAGG - ~ 

13801 CCTCGTTGCA CAGTTTAAAC AGCGAGGAGG AGCGCATTTT GCGCTACGTG 
GGAGCAAC6T GTCAAATTTG TCGCTCCTCC TCGCGTAAAA CQCGATGCAC 

13851 CAGCAGAGCG TGAGCCTTAA CCTGATGCGC GACGGGGTAA CGCCCAGCGT 
GTCGTCTCGC ACTCGGAATT GGACTACGC6 CTGCCCCATT GCGGGTCGCA 

13901 GGCGCTG6AC ATGACCGCGC GCAACATGGA ACCGGGCATG TATGCCTCAA 
CCGCGACCTG TACTGGC6CG CGTTGTACCT TGGCCCGTAC ATACGGA6TT 

13951 ACCGGCCGTT TATCAACCGC CTAATGGACT ACTTGCATCG CGCG6CC6CC 
TGGCC6GCAA ATAGTTGGCG 6ATTACCTGA T6AACGTAGC GCGCCG6CGG 

14001 GTGAACCCCG A6TATTTCAC CAA-TGCCATC TTGAACCCGC ACTGGCTACC 
CACTTGGGGC TCATAAAGTG CflACGGTAG AT^CTTGGGCG TGACC6ATGG 

14051 GCCCCCTGGT TTCTACACCG G0Q6ATTCGA GGTGCCCGAG GGTAACGATG 
C6GGGGACCA AAGATGTGGC CCCCTAAGCT CCACGGGCTC CCATTGCTAC 

14101 GATTCCTCTG GGaCGACATA GACGACAGCG TGTTTTCCCC GCAACCGCAG 
CTAAGGAGAC CCTGCTGTAT CTGCTGICGC ACAAAAGGGG CGTTGGCGTC 

14151 ACCCTGCTAG ASTTGCAACA GCGCGAGCAG GCAGAGGCGG CGCTGCGAAA 
TGGGACGATC TCAACGTTGT CCCGCTCGTC CGTCTCCGCC GCGACGCTTT 
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14201 

14251 

14301 

14351 

14401 

14451 

14501 

14551 

14601 

14651 

14701 

14751 

14801 

14851 

14901 

14951 

15001 

15051 

151D1 



GGAAAGCTTC' C^j^GGCCAA GCAGCTTGTC CGATCTAGt5Cr--t3»GC&|j|^C 
CCTTTCGAAO qBtCCGGTT CGTCGAACAG : GCTAGATCCG CGACGC^Pg 

CGCG6TCA6A T6CTAGTAGC CCATTTCCAA GCTTGATAGG GTCTCTTACC 
GCGCCA6TCT ACGATCATC6 GGTAAAGGTT CGAACTATCC CAGAGAATGG 

AGCACTCGCA CCACCCGCCC GCGCCTGCTG GGCGAGGAGG AGTACCTAAA 
TCGTGAGCGT GGTGGGCGGG CGCGGACGAC CCGCTCCTCC TCATGGATTT 

CAACTCGCTG CTGCAGCCGC AGCGCGAAAA AAACCTGCCT CCGGCATTTC 
GTTGAGCGAC GACGTCGGCG TCGCGCTTTT TTTGGACGGA GGCCGTAAAG 

CCAACAACGG GATAGAGAGC CTAGTGGACA AGATGAGTAG ATGGAAGACG 

GGTTGTTGCC CTATCTCTCG GATCACCTGT tctactcatc taccttctgc 

TACGCGCAGG AGCACAGGGA CGTGCCAGGC CCGCGCCCGC CCACCCGTCG 
ATGCGCGTCC TCGTGTCCCT GCACGGTCCG GGCCCGGGCG GGTGGGCAGC 

TCAAAGGCAC GACCGTCAGC GGGGTCTGGT GTGGGAGGAC GATGACTCGG 
AGTTTCCGTG CTGGCAGTCG CCCCAGACCA CACCCTCCTG CTACTGAGCC 

CAGACGACAG CAGCGTCCTG GATTTGGGAG GGAGTGGCAA CCCGTTTGCG 
GTCTGCTGTC GTCGCAGGAC CTAAACCCTC CCTCACCGTT GGGCAAACGC 

CACCTTCGCC CCAGGCTGGG GAGAATGTTT TAAAAAAAAA AAAAGCATGA 
GTGGAAGCGG GGTCCGACCC CTCTTACAAA ATTTTTTTTT TTTTCGTACT 

TGCAAAATAA AAAACTCACC AAGGCCAT6G CACCGAGCGT TGGTTTTCTT 
ACGTTTTATT TTTTGAGTGG TTCCGGTACC GTGGCTCGCA ACCAAAAGAA 

GTATTCCCCT TAGTATGCGG C6CGC6GCGA TGTATGAGGA AGGTCCTCCT 
CATAAGGGGA ATCATACGCC GCGC6CCGCT ACATACTCCT TCCAGGAGGA 

CCCTCCTACG AGAGTGTGGT GAGC6CGGCG CCAGTGGCOQ CGGCGCTGGG 
GGGAGGATGC TCTCACACCA CTCGCGCCGC GGTCACCGCC GCCGCGACCC 

TTCTCCCTTC GATGCTCCCC TGGACCCGCC GTTTGTGCCT CCGCX3GTACC 
AAGAGGGAAG CTACGAGGGG ACCTGGGCGG CAAACACOGA GGCGCCATGG 

TGCGGCCTAC CGGGGGGAGA AACAGCATCC GTTACTCTGA GTTQGCACCC 
ACGCCGGATG GCCCCCCTCT TTGTCGTAGG CAATGA6ACT CAACCGTGGG 

CTATTCGACA CCACCCGTGT GTACCTGGTG GACAACAAGT CAACGGATGT 
GATAAGCTGT GGTGGGCACA CATGGACCAC CTGTTGTTCA GTTGCCTACA 

G6CATCCCTG AACTACCAGA ACGACCACAC CAACTTTCTG ACCACGGTCA 
CCGTAG66AC TTGATGGTCT TGCTGGTOTC 6TTGAAAGAC T0GTGCCAGT 

TTCAAAACAA TQACTACAOC CCGGGSGAGG CAAGCACACA GACCATCAAT 
AAGTTTTGTT ACTGATGTCG GGCCCCCTCC GTTCGTGTGT CTGQTAGTTA 

CTTGACGACC GGTCGCACTG GGGCGSCGAC CT6AAAACCA TCCTGCATAC 
GAACTGCTG6 CCAGCGTGAC CCCGCPGCTG GACTTTTGGT AGGACGTATG 

CAACATGCCA AATGTGAACG AGTTCATGTT TACCAATAAG TTTAAGGCGC 
GTTGTACGGT TTACACTTGC TCAAGTACAA ATGGTTATTC AAATTCCGCG 




70/144 



wo 02/22080 



15151 GGGTGATGGT d|^CGCTTG CCTACTAA6G ACAATCAGGT GGAGCl^piA 
CCCACTACCA CAGCGCGAAC GGATGATTCC TGTTAGTCCA CCTCGACTTT 

15201 TACGAGTGGG TGGAGTTCAC GCT6CCCGAG GGCAACTACT CCGAGACCAT 
ATGCTCACCC ACCTCAAGTG CGACGGGCTC CCGTTGATGA GGCTCTGGTA 

15252 GACCATAGAC CTTATGAACA AC6C6ATC6T 6GA6CACTAC TT6AAAGTGG 
CTGGTATCTG GAATACTTGT TGCGCTAGCA CCTCGTGATG AACTTTCACC 

15301 GCAGACAGAA CGGGG1TCT6 6AAAGCGACA TCGG6GTAAA GTTTGACACC 
CGTCTGTCTT GCXTCCAAGAC CTTTCGCTGT AGCCCCATTT CAAACTGTGG 

15351 CGCAACTTCA GACT666GTT T6ACCCC6TC ACTG6TCTT6 TCATGCCTGG 
GC6TTGAAGT CTGACCCCAA ACTGGGGCAG T6ACCAGAAC AGTACGGACC 

15401 GGTATATACA AAC6AAGCCT TCCATCCAGA CATCATTTT6 CTGCCAGGAT 
CCATATAT6T TTCCTTCGGA AGGTAGGTCT 6TAGTAAAAC GACGGTCCTA 

15451 GCGGGGTGGA CTTCACCCAC AGCCGCCTGA GCAACTTGTT G6GCATCCGC 
CGCCCCACCT GAAGTG6GT6 TCGGCGGACT CGTT6AACAA CCCGTAGGCG 

15501 AA6CGGCAAC CCTTCCAGGA GGGCTTTAGG ATCACCTACG ATGATCTGGA 
TTCGCCGTTG G6AAGGTCCT CCCGAAATCC TAGTGGATGC TACTAGACCT 

15551 GGGTGGTAAC ATTCCCGCAC TGTTGGATGT GGACGCCTAC CAGGCGAGCT 
CXCACCATTG TAAGGGCGT6 ACAACCTACA CCTGCGGATG GTCCGCTCGA 

15601 T<3AAAGATGA CACCGAACAG GGCGGGGGTG GCGCAGGCGG CAGCAACAGC 
- ACTTTCTACr GTGGCTTGTC CCGCCCCCAC CGCGTCCGCC GTC6TTGTCG 

.15651_ AGTGGC AGCG GCGC G GAAGA GAACT CCAAC GCG GCA GCCG CGGCAATGCA 
TCACCGTCGC CGCGCCTTCT CTTGAGGTTG CGCCGTCGGC GCCGTTACGT 

15701 GCCGGTGGAG GACATGAACG ATCATGCCAT TCGCGGCGAC ACCTTTGCCA 
CGGCCACCTC CTGTACTTGC TAGTACGGTA AGCGCCGCTG TGGAAACGGT 

15751 CACGGGCTGA GGAGAAGCGC GCTGAGGCCG AAGCAGCGGC C6AAGCTGCC 
GTGCCCGACT CCTCTTCGCG CGACTCCGGC TTC6TCGCCG GCTTCGACGG 

15801 GCCCCCGCTG CGCAACCCGA GGTCGAGAAG CCTCAGAAGA AACCGGTGAT 
CGGGGGCGAC 6CGTTGGGCT CCAGCTCTTC GGAGTCTTCT TTGGCCACTA 

15851 CAAACCCCTG ACAGAGGACA GCAAGAAACG CAGTTACAAC CTAATAAGCA 
GTTTGGGGAC TGTCTCCTGT CGTTCTTTGC GTCAATGTTG 6ATTATTCGT 

15901 ATGACAGCAC CTTCACCCAG TACCGCAGCT GGTACCTTGC ATACAACTAC 
TACTGTCGTG GAAGTGGGTC ATCGCGTCGA CCATGGAAC6 TAT6TTGATG 

15951 GGCGACCCTC A6ACCGGAAT CCGCTCATCG ACCCTGCTTT GCACTCCTGA 

CCGCTGGGAG TCTGGCCTTA GGC6AGTACC TGGGAC6AAA CGT6AG6ACT . 

160.01 CGTAACCTGC. GGC7CGGAGC AGGTCTAC7G 6TC6TTGCCA GACATGAT6C 
6CATT0QACG CCGAGCCTCG TCCAGATG;^C CAGCAACGGT CTGTACTACG 

16051 AAGACCCCGT GACCTTCCGC TCCACGCGCC AGATCAGCAA CTTTCCGGTG 
TTCT6GGGCA CTGGAAGGCG AGGT6CGCGG TCTAGTCGTT GAAAGGCCAC 
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16101 GTGOGCGCCG A^OTTGCC CGTGCACTCC AAGAGCTTCrTlCAAa^jlCA ' 
• CACCCGCGGC flRcAACQG GCACGTGAGG TTCTCGAAQA TGTTGcSBt 

16151 GGCCGTCTAC TCCCAACTCA TCCGCCAGTT TACCTCTC.TG ACCCACGTGT 
CCGCCAGATG AGGGTTGAGT AGGCGGTCAA ATGGAGAGAC TGGGTGCACA 

16201 TCAATCGCTT TCCCGAGAAC CAGATTTrGG C6CGCCCGCC AGCGCCCACC 
AGTTACCGAA AGGGCTCTTG GTCTAAAACC GCGCGGGCGG TCGGGGGTGG 

16251 ATCACCACCG TCAGTGAAAA CGTTCCTGCT CTCACAGATC ACGGGACGCT 
TAGTGGTGGC AGTCACTTTT GCAAGGACGA GAGTGTCTAG TGCCCTGCGA 

16301 ACCGCTGCGC AACAGCATCG GAGGAGTCCA GCGAGTGACC ATTACTGACG 
TGGCGACGCG TTCTCGTAGC CTCCTCAGGT CGCTCACTG6 TAATGACTGC 

16351 CCAGACGCCG CACCTGCCCC TACGTTTACA AGGCCCTGGG CATAGTCTCG 
GGTCTGCGGC GTGGACOGGG ATGCAAATGT TCCGGGACCC GTATCAGAGC 

16401 CCGCGCGTCC TATCGAGCCG CACTTTTTGA GCAAGCATGT CCATCCTTAT 
GGCGCGCAGG ATAGCTCGGC GTGAAAAACT CGTTCGTACA GGTAGGAATA 

16451 ATCGCCCAGC AATAACACAG GCTGGGGCCT GCGCTTCCCA AGCAAGATGT 
TAGCGGGTCG TTATTGTGTC CGACCCCGGA CGCGAAOGGT TCGTTCTACA 

16501 TTGGCGGGGC CAAGAAGCGC TCCGACCAAC ACCCAGTGCG CCTGCGCGGG 
AACCGCCCCG GTTCTTCGCG AG6CTGGTTG TGGGTCACGC GCACGCGCCC 

16551 CACTACCGCG CGCCCTGGGG CGCGCACAAA CGCGGCCGCA CTGGGCGCAC 
GTGATGGCGC GCGGGACCCC GCGCGTGTTT GCGCCGGCGT GACCCGCGTG 

16601 CACCGTCGAT GACGCCATCG ACGCGGTGGT GGAGGAGGCG CGCAACTACA 
GTGGCAGCTA CTGCGGrAGC TGCGCCACCA CCTCCTCCGC GCGTTGATGT 

16651 CGCCCACGCC GCCACCAGTG TCCACAGTGG ACGCGGCCAT TCAGACCGTG 
GCGGGTGCGG CGGTGGTCAC AGGTGTCACC TGCGCC6GTA AGTCTGGCAC 

16701 GTGCGCGGAG CCCGGCGCTA TGCTAAAATG AAGAGACGGC GGA6GCGCGT 
CACQCOCCTC GGGCCGCGAT ACGATTTTAC TTCTCTGCCG CCTCCGCGCA 

16751 AGCACGTCGC CACCGCCGCC GACCCGGCAC TGCCGCCCAA CGCGCGGCGG 
TCGTGCAGCG GTQGCGGCGG CTGGGCCGTG ACGGCGGGTT GCGCGCCGCC 

16BD1 CGGCCCTGCT TAACCGCGCA CGTCGCACCG GCCGACGGGC GGCCATGCGG 
6CCG6GACGA ATTGGCGCGT GCAGCGTGGC CGGCTGCCCG CCGGTACGCC 

16851 GCCGCTCGAA GGCTGGCCGC GGGTATTGTC ACTGTGCCTC CCAGGTCCAG 
COGCGAGCTT CCGACCGGCG CCCATAACAG TGACACGGGG GGTCCA6GTC 

16901 GC6ACGA6CG GCCGCCGCAG CAGCCGCX3GC CATTAGT6CT ATGACTCAGG 
CGCTGCTCGC CGGCGGCGTC GTCGGCGCCG GTAATCACGA TACTGAGTCC 

16951 CTCGCAGGGG CAACGTGTAT TGGGTGCGCG ACTCGGTTAG CGGCCTGCGC 
CAGCGTCCCC GTTGCACATA ACCCACGCGC TGAGCCAATC GCCGGACGCG 

17001 GTGCCCGTGC GCACCCGCCC CCCGCGCAAC TAGATTGCAA GAAAAAACTA 
CACOGGCACG CGTGG6CGGG GGGCGCX3TTC ATCTAACGTT CTTTTTTGAT 
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17051 CTTAGACTCG T^BGTTGTA TGTATCCAGC GCCGGCGWJG Ulit:AAL;|^y 
GAAICTGAGC AfflACAACAT ACATAGGTC6 CC6CCGCC0C QCGTTGCKC 

17101 CTATGTCCAA GCGCAAAATC AAAGAAC3AGA TGCTCCAGGT CATCGCGCCG 
GATACAGGTT CGCGTTTTAG TTTCTTCTCT ACGA6GTCCA GTAGCGCGGC 

17151 GAGATCTATG GCCCCCCGAA GAAGGAAGAG CAGGATTACA AGCCCCGAAA 
CTCTAGATAC CQGGGSGCTT CTTCCTTCTC GTCCTAATGT TCGGGGCTTT 



17201 GCTAAAGCGG GTCAAAAAGA AAAAGAAAGA TGATGATGAT GAACTTGACG 
CGATTTCGCC CAGTTTTTCT TTTTCTTTCT ACTACTACTA CTTGAACTGC 

17251 ACGAGGTOSA ACTGCTGCAC GCTACCGCGC CCAGGCGACG GGTACAGTGG 
TGCTCCACCT TQACSACGTG CGATGGCGCG GGTCCGCTGC CCATGTCACC 

17301 AAAGGTCQAC GCOTAAAACG TGTTTTGCGA CCCGGCACCA CCGTAGTCTT 
TTTCCAGCT6 CGCATTTTGC ACAAAAC6CT GGGCCGTGGT GGCATCAGAA 

17351 TACGCCCGGT GA6CGCTCCA CCCGCACCTA CAAGCGCGTG TATGATGAGG 
ATGCGGGCCA CTCGCGAG6T GGGCGTGGAT GTTCGCGCAC ATACTACTCC 

17401 TGTACGGCGA CGAGGACCTG CTTGAGCAGG CCAACGAGCG CCTCGGGGAG 
ACATGCCGCT GCTCCTGGAC GAACTCGTCC GGTTGCTCGC GGAGCCCCTC 

17451 TTTGCCTACG GAAAGCGGCA TAAGGACATG CTGGCGTTGC CGCTGGACGA 
AAACGGATGC CTTTCGCCGT ATTCCTGTAC GACCGCAACG GCGACCTGCT 

17501 GGGCAACCCA ACACCTAGCC TAAAGCCCGT AACACTGCAG CAGGTGCTGC 
CCCGTTGGGT TGTGGATCGG ATTTCGGGCA TTGTGACGTC GTCCACGACG 

""17551 CCGCGCTTGC ACCGTCCGAA "GAAAAGCGCG GCCTAAAGCG~CGAGTCTGGT- 
GGCGCGAACG TGGCAGGCTT CTTTTCGCGC CGGATTTCX GCTCAGACCA 

17601 GACTTGGCAC CCACCGTGCA GCTGATGGTA CCCAAGCGCC AGCGACTGGA 
CTGAACCGTG GGTGGCACGT CGACTACCAT 6GGTTCGCGG TCGCTGACCT 

17651 AGATGTCTTG GAAAAAATGA CCGTGGAACC TGGGCTGGAG CCCX3AGGTCC 
TCTACAGAAC CTTTTTTACT GGCACCTTGG ACCCGACCTC GGGCTCCAGG 

^7701 GCGTGCGGCC AATCAAGCAG GTGGCOCCGG SACTGGGCGT GCAGACCGTG 
CGCACGCCG6 TTAGTTCGTC CACCGCGGCC CTGACCCGCA CGTCTGGCAC 

-7751 GACGTTCAGA TACCCACTAC CAGTAGCACC AGTATTGCCA CCGCCACAGA 
CTGCAAGTCT ATOGGTGATG GTCATC6TG0 TCATAACGGT 6QC6GTGTCT 

17801 GGGCATGGAG ACACAAACGT CCCCGGTTGC CTCAGC6GTG GCG6ATGCCG 
CCCGTACCTC TGTGTTTCCA GGGGCCAACG GAGTCGCCAC CGCCTACGGC 

CGGTGCAG6C GGTCGCTGCG CCCGCCTCCA AGACCTCTAC GGAGGTGCAA 
GCCACGTCCG CCAGCGACOC C6GCGCAQGT TCTGGAGAT6 CCTCCACGTT 

ACGGACCC6T GGATGTTTCe COTTTCASCC CCCCOGCGCC CGCGCCGTTC 
TGCCTGGGCA CCTACAAAGC GCAAAGTCGG GGGGCCGCGG GCGCGGCAAG 

GAGGAAGTAC GGCGCCGCCA GCGCGCTACT GCCCGAATAT GCCCTACATC 
CTCCTTCATG CCGCGGCGGT CGCGCGATGA CGGGCTTATA CGGGATGTAG 
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18001 CTTCCATTGC GJXTACCCCC GGCTATCGTG GCTACACOTftSXCJCeC^jEA 
CAAGGTAACG d^kTGGGGG CCGATAGCAC CGATGTGGAT GGCGGQ^^T 

18051 AGACGAGCAA CTACCCGACG CCGAACCACC ACTGGAACCC GCCGCCGCCG 
TCTGCTCGTT GATGGGCTGC GGCTTGGTGG TGACCTTGGG CGGCGGCGGC 

18101 TCGCCGTCGC CA6CCCGTGC TGGCCXCGAT TTCCGTGCGC AGGGTGGCTC 
AGCGGCAGCG GTCGGGCACG ACCGGGGCTA AAGGCACGCG TCCCACCGAG 



18151 GCGAAGGAGG CAGGACCCTG GTOCTGCCAA CAGC6CGCTA CCACCCCAGC 
CGCTTCCTCC GTCCTGGGAC CACGACGGTT GTCGCGC6AT GGTGGGGTCG 

18201 ATCGTTTAAA AGCCGGTCTT TGTGGTTCTT GCAfiATATGG CCCTCACCTG 
TAGCAAATTT TCGGCCA6AA ACACCAAGAA CGTCTATACX: GGGAGTG6AC 

18251 CCGCCTCCGT TTCCCGGTGC CGGGATTCCG AGGAAGAATG CACCGTAGGA 
GGCGGAGGCA AACGGCCACG GCCCTAAGGC TCCTTCTTAC GTGGCATCCT 

18301 GGGGCAT6GC CGGCCACGGC CTGACG6GCG GCATGCGTCG TGCGCACCAC 
CCCCGTACCG GCCGGTGCCG GACTOCCCGC CGTACGCAGC ACGCGTG6TG 

18351 CGGCGGCGGC GCGCGTCGCA CCGTC6CATG CGCGGCGGTA TCCTGCCCCT 
GCCGCCGCCG C6CGCAGCGT GGCAGCGTAC GCGCCGCCAT AGGACGGGGA 

18401 CCTTATTCCA CTGATCGCCG CGGCGATTGG CGCCGTGCCC GGAATTGCAT 
GGAATAAGGT GACTAGCGGC GCCGCTAACC GCGGCACGGG CCTTAACGTA 

18451 CCGTGGCCTT GCAGGCGCAG AGACACTGAT TAAAAACAAG TTGCATGTGG 
GGCACCGGAA CGTCCGCGTC TCTGTGACTA ATTTTTGTTC aacgtacacc 

18501 AAAAATCAAA ATAAAAAGTC TGGACTCTCA CGCTCGCTTG GTCCTGTAAC 
TTTTTAGTTT TATTTTTCAG ACCTGAGAGT GCGA6CQAAC CAGGACATTG 

18551 TATTirGTAG AATGGAAGAC ATCAACTTTG CGTCTCTGGC CCCGCGACAC 
ATAAAACATC TTACCTTCTG TAGTTGAAAC GCAGAGACCG GGGCGCTGTG 

18601 GGCTCGCGCC CGTTCATGGG AAACTGGCAA GATATCGGCA CCAGCAATAT 
CCGAGCGCGG GCAAGTACCC TTTGACCGTT CTATAGCCGT GGTCGTTATA 

18651 GAGCGGTGGC GCCTTCAGCT GGGGCTCGCT GTGGAGCGGC ATTAAAAATT 
CTCGCCACCG CGGAAGTCGA CCCCGAGCGA CACCTCGCCG TAATTTTTAA 

18701 TCGGTTCCAC CGTTAAGAAC TATGGCTlGGA AGGCCTGGAA CAGCAGCACA 
AGCCAAGGTG GCAATTCTTG ATACCGTCGT TCCGGACCTT GTCGTCGTGT 

18751 GGCCAGATGC TGAGG6ATAA GTTGAAAGAG CAAAATTTCC AACAAAAGGT 
CCGGTCTACG ACTCCCTATT CAACTTTCTC GTTTTAAAGG TTGTTTTCCA 

18801 GGTAGATGGC CTGGCCTCTG GCATTAGCGG GGT6GTGGAC CTGGCCAACC 
CCATCTACCG GACCGGAGAC CGTAATCGCC CCACCACCTG 6ACCGGTTGG 

18851 AGGCAGTGCA AAATAAGATT AACAGTIAAGC TTGATCCCCG CCCTCCCGTA 
TCC6TCACGT TTTATTCTAA TTGTCATTCG AACTAGGGGC 6GGAGGGCAT 

18901 GAGGAGCCTC CACCGGCCGT GGAGACAGTG TCTCCAGAGG 6GCGTGGCGA 
CTCCTCGGAG GTGGCCGGCA CCTCTGTCAC AGAGGTCTCC CCGCACCGCT 
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18951 AAAGCGTCCG C^CCGACA GGGAAGAAAC TCTG6T6ACG"CAAATA|fcG 
TTTCGCAGGC GcRgGCTGT CCCTTCTTTG AGACCACTGC ffTTTAT^Ec 

19001 agcctccctc gtacgaggag gcactaaagc aaggcctgcc caccacccgt 
tcggagggag catgctcctc cgtgatttcg ttccggacgg gtggtgggca 

19051 cccatcgcgc ccatggctac cggagtgctg ggccagcaca cacccgtaac 
gggtagcgcg ggtaccgatg gcctcacgac ccggtcgtgt gtgggcattg 



19101 6CTGGACCTG CCTCCCCCCG CCGACACCCA GCAGAAACCT GTGCTGCCAG 

cgacctggac gsaggqgggc ggctgtgggt cgtctttgga cacgacggtc 

19151 gcccgaccgc cgttgttgta acccgtccta gccgcgcgtc cctgcgccgc 
cgggctggcg gcaacaacat tgggcaggat cggcgcgcag ggacgcggcg 

19201 gccgccagcg gtccgcgatc gttgcggccc gtagccagtg gcaactggca 
cggcggtcgc caggcgctag caacgccggg. catcggtcac cgttgaccgt 

19251 aagcacactg aacagcatcg tgggtctggg ggtgcaatcc ctgaagcgcc 
ttcgtgtgac ttgtcgtagc acccagaccc ccacgttagg gacttcgcgg 

19301 gacgatgctt ctgatagcta acgtgtcgta tgtgtgtcat gtatgcgtcc 
ctgctacgaa gactatcgat tgcacagcat acacacagta catacgcagg 

19351 atgtcgccgc cagaggagct gctgagccgc cgcgcgcccg ctttccaaga 
tacagcggcg gtctcctcga cgactcggcg gc6cgcg6gc gaaaggttct 

19401 tggctacccc ttcgatgatg ccgcagtggt cttacatgca catctcgggc 
accgatgggg aagctactac ggcgtcacca gaatgtacgt gtagagcccg 

19451 CAGGACGCCT"CGGAGTACCT GAGCCCCGG6 CTGGTGCAGT- TTGCCCGCGC 
GTCCTGCGGA GCCTCATGGA CTCGGGGCCC GACCACGTCA AACGGGCGCG 

19501 CACCGAGACG TACTTCAGCC TGAATAACAA GTTTAGAAAC CCCACGGTGG 
GTGGCTCTGC ATGAAGTCGG ACTTATTGTT CAAATCTTTG G6GTGCCACC 

19551 CGCCTACGCA CGACGTGACC ACAGACC6GT CCCAGCGTTT GACGCT6CGG 
GCGGATGCGT GCTGCACTGG TGTCTGGCCA GGGTCGCAAA CTGCGACQCC 

19501 TTCATCCCTG TG6ACCGTGA GGATACTGCG TACTCGTACA AGGCGCGGTT 
AAGTAGGGAC ACCTGGCACT CCTATGACGC ATGA6CAT6T TCCGCGCCAA 

19651 CACCCTAGCT GT6GGTGATA ACCGTGTGCT 6GACAT6GCT TCCACGTACT 
GTGGGATCGA CACCCACTAT TGGCACACGA CCTGTACCGA AGGTGCATGA 

19701 TTGACATCCG CG6CGTCCTC GACAGGGGCC CTACTTTTAA GCCCTACTCT 
AACTGTAGGC GCCOCACGAC CTGTCCCCGG GAT6AAAATT CGGGATGAGA 

19751 GGCACTGCCT ACAACOCCCT G6CTCCCAAG GGTGCCCCAA ATCCTTGCGA 
CCGTGACGGA TGTTGCGGGA CCGAGG6TTC CCAC6G6GTT TAGGAACGCT 

19BC1 ATGGGATGAA CCTGCTACTG CTCTTGAAAT AAACCTAGAA GAAGAGGACG 
TACCCTACTT CGAC6ATGAC GAGAACTTTA TTTGGATCTT CTTCTCCTGC 

19851 ATGACAACGA AGACGAAGTA 6ACGAGCAAG CTGAGCAGCA AAAAACTCAC 
TACTGTT6CT TCTCCTTCAT CTGCTCCTTC GACTCGTCGT TTTTTGAGTG 
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19901 . GTATTTCGGC J^^pOCCTTA TTCTGGTATA AATATTACAA AGGAGQMT 
»TAAACCCG ^^GCGGAAT AAGACXIATAT TTATAATGTO TCCTCCCTTA 

19951 TCAAATAGGT GTCGAAGGTC AAACACXJTAA ATATGCCGAT AAAACATTTC 
AGTTTATCCA CACCTTCCAG TTTGTGGATT TATACGGCTA TTTTGTAAA6 

20001 AACCTGAACC TCAAATAGGA GAATCTCAGT GGTACGAAAC AGAAATTAAT 
TTGGACTTGG AGTTTATCCT CTTA6AGTCA CX:ATGCTTTG TCTTTAATTA 

20051 CATGCAGCTG 06AGAGTCCT AAAAAAGACT ACCCCAATGA AACCATGTTA 
GTACGTC6AC OCTCTCAGGA TTTTTTCTGA T6GGGTTACT TTGCTACAAT 

20101 CGGTTCATAT GCAAAACCCA CAAATGAAAA T6GAGG6CAA GGCATTCTTG 
GCCAAGTATA CGTTTT6GGT GTTTACTTTr ACCTCCCGTT CCGTAAGAAC 

20151 TAAAGCAACA AAATGGAAAG CTAGAAAGTC AAGTGGAAAT GCAATTTTTC 
ATTTCGTTGT TTTACCTTTC 6ATCTTTCAC TTCACCTTTA CCTTAAAAAC 

20201 TCAACTACTG AGGCAGCCGC AGGCAATGGT GATAACTTGA CTCCTAAAGT 
AGTTGATGAC TCCQTCGQCG TCCGTTACCA CTATTGAACT GAGGATTTCA 

20251 GGTATTGTAC AGTGAAGATG TAGATATA6A AACCCCAGAC ACTCATATTT 
CCA^AACATG TCACTTCTAC ATCTATATCT TTGGGGTCTG TGAGTATAAA 

20301 CTTACATGCC CACTATTAAG GAAGGTAACT CACGAGAACT AATGGGCCAA 
GAATGTACG6 GTGATAATTC CTTCCATTGA GTGCTCTTGA TTACCCGGTT 

20351 CAATCTATGC CCAACAGGCC TAATTACATT GCTTTTAGGG ACAATTTTAT 
GTTAGATACG GGTTGTCC6G ATTAATGTAA CGAAAATCCC TGTTAAAATA 

20401 TGGTCTAATG TATTACAACA GCACGGGTAA TATGGGTGTT CTGGCGGGCC 
ACCAGATTAC ATAATGTTGT CGTGCCCATT ATACCCACAA GACCGCCCGG 

20451 AAGCATCGCA GTTGAATGCT GTTGTAGATT TGCAAGACAG AAACACAGAG 
TTC6TAGCGT CAACTTACGA CAACATCTAA ACGTTCTGTC CTTGTGTCTC 

20501 CTTTCATACC A6CTTTTGCT TGATTCCATT GGTGATAGAA CCAGGTACTT 
GAAAGTATGG TCGAAAACGA ACTAAGGPAA CCACTATCTT GGTCCATGAA 

20551 TTCTATGTGG AATCAGGCTG TTGACAGCTA TGATCCAGAT GTTAGAATTA 
AAGATACACC TTAGTCCGAC AACTGTCGAT ACTAGGTCTA CAATCTTAAT 

20601 TTGAAAATCA TGGAACTGAA GATGAACTTC CAAATTACTG CTTTCCACTG 
AACTTTTAGT ACCTTGACTT CTACTTGAAG GTTTAAT6AC GAAAGGTGAC 

20651 GGAGGTGTGA TTAATACAGA GACTCTTACC AAGGTAAAAC CTAAAACAGG 
CCTCCACACT AATTAT6TCT CTGftGAATQO TTCCATTTTG 6ATTTTGTCC 

20701 TCAGGAAAAT GGATGGGAAA AAGA7GCTAC AGAATTTTCA GATAAAAATG 
AGTCCTTTTA CCTaCCCTTT TTCTACGATG TCTTAAAAGT CTATTTTTAC 

20751 AAATAAGA6T TGGAAATAAT TTTGCCATOG AAATCAATCT AAATGCCAAC 
TTTATTCTCA ACCTTTATTA AAACGGTACC TTTAGTTAGA TTTACGGTTG 

20801 CTGTGGAGAA ATTTCCTGTA CTCCAACATA GCGCTQTATT TGCCCGACAA 
GACACCTCTT CAAAGGACAT CAGGTTGTAT CGC6ACATAA ACGGGCTGTT 
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20851 GCTAAAGTAC J^CTTCCh ACGTAAAAAT TTC'TGhT^^'^CJ^ii^^l^^T 
CGATTTCATG ^IgAAGGT T6CATTTTTA AAGACTATTG GGTTTG^A 

20901 ACGACTACAT GAACAAGC6A GTGGTGGCTC CCGGGCTAGT GGACTGCTAC 
TGCTGATGTA CTTGTTC6CT CACCACCGAG QGCCCGATCA CCTGACGATG 

20951 ATTAACCTTG 6AGCACGCTG GTCCCrXGAC TATATGGACA ACGTCAACCC 
TAATTGGAAC CTC6TGC6AC CAGGGAACTG ATATACCTGT TGCAGTTGGG 

21001 ATTTAACCAC CACCGCAATG CTGGCCTGCG CTACCGCTCA ATGTTGCTGG 
TAAATTGGTG GTGGCGTTAC GACCGGACGC GATGGCGAGT TACAACGACC 

21051 GCAATGGTCG CTATGTGCCC TTCCACATCC AGGTGCCTCA GAAGTTCTTT 
CGTTACCAGC GATACACGGG AAGGTGTAGG TCCACGGAGT CTTCAAGAAA 

21101 6CCATTAAAA ACCTCCTTCT CCTGCCGGGC TCATACACCT ACGAGTGGAA 
CGGTAATTTT TGGAGGAAGA GGACGGCCCG AGTATGTG6A TGCTCACCTT 

21151 CTTCAGGAAG GATGTTAACA TGGTTCTGCA GAGCTCCCTA 6GAAATGACC 
GAAGTCCTTC CTACAATTGT ACCAAGAC6T CTCGAGGGAT CCTTTACTGG 

21201 TAAGGGTTGA CGGAGCCAGC ATTAAGTTTG ATAGCATTTG CCTTTACGCC 
ATTCCCAACT GCCTCGGTCG TAATTCAAAC TATCGTAAAC G(3AAATGCGG 

21251 ACCTTCTTCC CCATGGCCCA CyiACACCCCC TCCACGCTTG AQGCCATGCT 
TGGAAGAAGG GGTACCG60T GTTGT6GCGG AGGTGCGAAC TCCGGTACGA 

21301 TA6AAAC6AC ACCAACGACC AGTCCTTTAA CGACTATCTC TCCGCCGCCA 
ATCTTTGCTG TGGTTGCTGG TCAGGAAATT QCTGATAGAG AGGCGGCGGT 

21351 ACATGCTCTA CCCTATACCC GCCAACGCTA CCAACGTGCC CATATCCATC 
TCTACGAGAT-GGGATATGGG-CGGTTCCGAT„GG!I!TGjy^G ^ _ 

21401 CCCTCCCGCA ACTCGGCGGC TTTCCGCGGC TGGGCCTTCA CGCGCCTTAA 
GGGAGGGCGT TGACCCGCCG AAAGGC6CCG ACCCGGAAGT QCGCGGAATT 

21451 GACTAAGGAA ACCCCATCAC TGGGCTCGGG CTACGACCCT TATTACACCT 
CTGATTCCTT TGQGGTAGTG ACCCGAGCCC GATGCTGGGA ATAATGTGGA 

21501 ACTCTGGCTC TATACCCTAC CTAGATGGAA CCTTTTACCT CAACCACACC 
TGAGACCGAG ATATGGGATG GATCTACCTT GGAAAATGGA GTTGGTGTGG 

21551 TTTAAGAAGG TGGCCATTAC CTTTGACTCT TCTGTCAGCT GGCCTGGCAA 
AAATTCTTCX: ACCGGTAATG GAAACTGA6A AGACAGTCGA CCGGACCGTT 

21601 TGACCGCCTC CTTACCCCCA ACGAGTTTGA AATTAAGCGC TCAGTTGACG 
ACTGGCGGAC GAATGGGGGT TGCTCAAACT TTAATTCGCG AGTCAACTGC 

21651 GGGAGGGTTA CAACGTTGCC CAGTGTAACA TGACCAAAGA CTQGfTTCCTG 
CCCTCCCAAT GTTGCAACGG GTCACATTGT ACTGGTTTCT GACCAAGGAC 

21701 GTACAAATGC TAGCTAACTA TAACATTGGC TACCAG6GCT TCTATATCCC 
CATGTTTACG ATCGATTGAT ATTGTAACCG ATGGTCCCX» AGATATA6QG 



21751 



AGAGAGCTAC AAGGACCGCA TGTACTCCTT CTTTA6AAAC TTCCAGCCCA 
TCTCTCGATG TTCCTGGCGT ACATGAGGAA GAAATCTTTG AAGGTCGGGT 
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21801 TGAGCCGTCA GMGGTGGAT GATACTAAAT ACAAGGAD!rA«^(S^AC?^eg»G 
ACTCCCCAGT cUcCACCTA CTATGATTTA TGTTCCTGAT GGTTGl^Lc 

218S1 GGCATCCTAC ACCAACACAA CAACTCTGGA TTTGTTGGCT ACCTTGCCCC 
CCGTAGGATG TOGTTGTGTT GTTGAGACCT AAACAACC6A TGGAACGGGG 

21901 CACCATGCGC GAAGGACAGG CCTACCCTGC TAACTTCCCC TATCCGCTTA 
GTGGTACGCG CTTCCTGTCC GGATGGGACG ATTGAAGGGG ATAGGCGAAT 

21951 TAGGCAAGAC CGCAGTTGAC ACCATTACCC AGAAAAAGTT TCTTTGCGAT 
ATCC6TTCTG GCGTCAACTG TCGTAATGGG TCTTTTTCAA AGAAAC:GCTA 

22001 cgcacccttt ggcgcatccc attctccagt aactttatgt ccatgggcgc 
gcgtgggaaa ccgcgtaggg taagaggtca ttgaaataca ggtacccgcg 

22051 actcacagac ct6ggccaaa accttctcta cgccaactcc gcccacgcgc 
TGAGTGTCTG gacccggttt tggaagagat gcogttqagg cgggtocgcg 

22101 tagacatgac ttttgaggtg gatcccatog acgagcccac ccttctttat 
atctgtactg aaaactccac ctagggtacc tgctcgggtg ggaa6aaata 

22151 gttttgtttg aagtctttga c6tggtccgt gtgcaccagc cgcaccgcgg 
caaaacaaac ttcagaaact gcaccaggca cacgtggtcg gcgtgiscgcc 

22201 cgtcatcgaa accgtgtacc tgcgcacgcc cttctcggcc ggcaacgcca 
gcagtagctt tg6cacatgg acccgtccgg gaagagccgg ccgttgcggt 

22251 caacataaa6 aagcaagcaa catcaacaac a6ctgccgcc atgggctcca 
gttgtatttc ttcgttcgtt gtagt7gttg tcgacggcgg tacccgaggt 

22301 gtgagcag6a act6aaagcc attgtcaaag atcttggttg t6ggccatat 
cactcgtcct tgactttcgg taacagtttc tagaaccaac accc6gtata 

22351 tttttgggca cctatgacaa gcqctttcca cgctttcttt ctccacacaa 
aaaaaccc6t ggatactgtt cgcgaaaqgt ccgaaacaaa gaogtgtgtt 

22401 gctcgcctgc gccatagtca atacggccgg tcgcgagact gggggcgtac 

CGA6CGGACG CGGTATCAGT TATX3CCGQCC AGCGCTCTGA CCCCCGCATG 

22451 ACTGGATGGC CTTTGCCTGG AACCCGCACT CAAAAACATG CTACCTCTTT 
TGACCTACCG GAAACGGACC TTGGGCGTGA GTTTTTGTAC GATGGAGAAA 

22501 GAGCCCTTTG GCTTTTCTGA CCAGCGACTC AAGCAGGTTT ACCAGTTTGA 
CTCGGGAAAC C6AAAAGACT GGTCGCTGAG TTCGTCCAAA TGGTCAAACT 

22551 GTACGAGTCA CTCCTGCGCC GTAGCGCCAT TGCTTCTTCC CCCGACCGCT 
CATOCTCAGT GAGGACGCG6 CATCGCGGTA ACGAAGAAGG GGGCTGGCGA 

22601 GTATAACGCT GGAAAAGTCC ACCCAAAGCG TACAGGGGCC CAACTCGGCC 
CATATTGCGA CCTTTTCAGG TGGGTTTCGC ATCTCCCCCG GTTGAGCCG6 

■ 

22651 GCCTGTGGAC TATTCTGCTG CATGTTTCTC CACGCCTTTG CCAACTGGCC 
C6GACACCTG ATAAGACGAC GTACAAAGAG GTGC6GAAAC GGTTGACC6G 

22701 CCAAACTCCC ATGGATCACA ACCCCAC'CAT GAACCTTATT ACCGGGGTAC 
GGTTTGAGGG TACCTAGTGT TOGGGTGGTA CTTGGAATAA TGGCCCCATG 
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22751 
22801 
22851 
22901 

22951 

23001 

23051 

23101 

2315a 

23201 

23251 

23301 

23351 

23401 

23451 

23501 
* 

23551 
23601 
23651 



CCAACTCCAT GCTCAACAGT CCCCAOGTIAC AGCCCACq^jPf€J 



CAGGAACAGC TCTACAGCTT CCTGGAGCGC CACTCGCCCT ACTTCCGCAG 
GTCCTTGTCG AGAT6TCGAA GGACCTCGC6 GTGAGCGGGA TGAAGGCGTC 

CCACAGTGCG CAGATTAGGA GCGCCACTTC TTTTTGTCAC TTGAAAAACA 
GGTGTCACGC GTCTAATCCT CGCGGTGAAG AAAAACAGTG AACTTTTTGT 

TGTAAAAATA ATGTACTAGA GACACrTTCA ATAAAGGCAA ATGCTTTTAT 
ACATTTTTAT TACATGATCT CTGTGAAAGT TATTTCCGTT TACGAAAATA 



TTGTACACTC TCGGGTCATT ATTTACCCCC ACCCTTGCCG TCTGCGCCGT 
AACATGTGAG AGCCCACTAA TAAATGGGGG TGGGAACGGC AGACGC6GCA 

TTAAAAATCA AAGGGGTTCT GCCGCGCATC GCTATGCGCC ACTGGCAG6G 
AATTTTTAGT TTCCCCAAGA CGGCGCGTAG CGATACGCGG TGACCGTCCC 

ACACGTTGCG ATACTGGTGT TTAGTGCTCC ACTTAAACTC AGGCACAACC 
TGTGCAACGC TATGACCACA AATCACGAGG TGAATTTGAG TCCGTGTTGG 

ATCCGCGGCA GCTCGGTGAA GTTTTCACTC CACAGGCTGC GCACCATCAC 
TAGGCGCCGT CGAGCCACTT CAAAAGTGAG GTGTCCGAC6 CGTGGTAGTG 

CAACGCGTTT AGCAGGTCGG GCGCCGATAT CTTGAAGTCG CAGTTGGGGC 
GTTGCGCAAA TCGTCCAGCC CGCGGCTATA GAACTTCAGC GTCAACCCCG 

CTCCGCCCTG CGCGCGCGAG TTGCGATACA CAGGGTT6CA GCACTGGAAC 
GAGGCGGGAC GCGCGCGCTC AACGCTATGT GTCCCAACGT CGT6ACCTTG 

ACTATCAGCG CCGGGT6GTG CAC6CTGGCC AGCACGCTCT TGTCGGAGAT 
TGATAGTCGC GGCCCACCAC GTGCGACCGG TCGTGCGAGA-ACAGCCTCTA 

CAGATCCGCG TCCAGGTCCT CCGCGTTGCT CAGGGCGAAC GGAGTCAACT 
GTCTAGGCGC AGGTCCAGGA GGCGCAACGA GTCCCGCTT6 CCTCAGTTGA 

TTGGTAGCTG CCTTCCCAAA AAGGGCGCGT GCCX:AGGCTT TGAGTTGCAC 
AACCATCGAC GGAAGGGTTT TTCCCGCGCA CGGGTCCGAA ACTCAACGTG 

TC6CACCGTA GTGGCATCAA AAGGTGACCG TGCCCGGTCT GGGCGTTAGG 
AGCGTGGCAT CACCGTAGTT TTCCACTGGC ACQGOCCAGA CCCGCAATCC 

ATACAGCGCC TGCATAAAAG CCTTGATCTG CTTAAAAGCC ACCTGAGCCT 
TATGTCGCGG ACQTATTTTC GGAAC?A6AC GAATTTTCGG TGGACTCGGA 

TTGCGCCTTC ASAGAAGAAC ATGCXGCAAG ACTTGCCGGA AAACTGATTG 
AACOCGGAAG TCTCTTCTT6 TAOGGCGTTC TGAACGQCCT TTTCACTAAC 

GCCG6ACAGG CCGCGTCGTG CACGCAGCAC CTTGCGTCGG TGTTGGAGAT 
CCGCCTGTCC G6CGCAGCAC GTGCGTCGTG GAACGCAGCC ACAACCTCTA 

CTGCACCACA TTTCGGCCCC ACCGGTTCTT CACGATCTTG GCCTTGCTAG 
GACGTOGTGT AAAGCCQGQG TGGCCAAGAA GTGCTAGAAC CGGAACGATC 

ACTGCTCCTT CAGCGCGCGC TGCCCGTTTT CGCTCGTCAC ATCCATTTCA 
TGACCAGGAA CTCGCGCGCG ACGGGCAAAA GCGAGCAGTG TAGGTAAAGT 
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23701 ATCACGTGCT dWATTTAT CATAATSCTT CCGTGTAGAC ACTTAaIRc 
TA6TGCACGA GGAATAAATA GTATTACGAA GGCACATCTG TCAATTCGAG 

23751 GCCTTCGATC TCAGCGCAGC GGTGCAGCCA CAAC6CGCAG CCC6TGGGCT 
CGGAAGCTAG AGTCGCGTCG CCACGTCGGT GTTGCGCGTC GGGCACCCGA 

23801 CGTGATGCTT GTAGGTCACC TCTGCAAACG ACTGCAGGTA CGCCTGCAGG 
GCACTACGAA CATCCAGTGG AGACGCTTGC TGACGTCCAT GCGGACGTCC 

23851 AATCGCCCCA TCATCGTCAC AAAGGTCTTG TTGCTGGTGA AGGTCAGCTG 
TTAGCGGGGT AGTAGCAGTG TTTCCAGAAC AACGACCACT TCCAGTCGAC 



23901 CAACCCGCGG TGCTCCTCGT TCAGCCAGGT CTTGCATACG GCCGCCAGAG 
GTTGGGCGCC ACGAGGAGCA AGTCGGTCCA GAACGTATGC CGGCGGTCTC 

23951 CTTCCACTTG GTCAGGCAGT AGTTTGAAGT TCGCCTTTAG ATCGTTATCC 
GAAGGTGAAC CAGTCXIGTCA TCAAACTTCA AGCGGAAATC TAGCAATAGG 

24001 ACGTGGTACT TCTCCATCAG CGCGCGCGCA GCCTCCATGC CCTTCTCCCA 
TGCACCATGA ACAGGTAGTC GCGCGCGCGT CG6AGGTACG GGAAGAGGGT 

24051 CGCAGACACG ATCGGCACAC TCAGCGGGTT CATCACCGTA ATTTCACTTT 
GCGTCTCTGC TAGCCGTGTG AGTCGCCCAA GTAGTGGCAT TAAAGTGAAA 

24101 CCGCTTCGCT GGGCTCTTCC TCTTCCTCTT GCGTCCGCAT ACCACGCGCC 
GGCGAAGCGA CCCGAGAAGG AGAAGGAGAA CGCAGGC6TA TGGTGCGCGG 

24151 ACTGGGTCGT CTTCATTCAG CCGCCGCACT GTGCGCTTAC CTCCTTTGCC 
TGACCCAGCA GAAGTAAGTC GGCGGCGTGA CACGCGAATG GAGGAAAC6G 

24201 ATGCTTGATT AGCACCGGTG GGTTGCTGAA ACCCACCATT TGTAGCGCCA 
TACGAACTAA TCGTG6CCAC CCAACGACTT TGGGTGGTAA ACATC6CGGT 

24251 CATCTTCTCT TTCTTCCTCG CTGTCCACGA TTACCTCTO6 TGATGGCGGG 
GTAGAAGAGA AA6AAGGAGC GACAGGTGCT AATGGAGACC ACTACCGCCC 

24301 CCCTCGGGCT TGGGAGAAGG GCGCTTCTTT TTCTTCTTGG GCGCAATGGC 
GCGAGCCCGA ACCCTCTTCC CGCGAAGAAA AACAAGAACC CQCGTTACCG 

24351 CAAATCCGCC GCCGAGGTCG ATGGCCGCX3G GCTGGGTGTG CGCGGCACCA 
6TTTAGGCGG CGGCTCCAGC TACCGGCGCC CGACCCACAC GCGCCGTGGT 

24401 GCCCGTCTTG TGATGAGTCT TCCTCGTCCT CGGACTCGAT ACGCCGCCTC 
CGCGCAGAAC ACTACTCAGA AGGA6CAGGA GCCTGAGCTA TGCGGCGGAG 

24451 ATCCGCTTTT OTGGOGGCGC CCGGGGAGGC GGCGGCGACG GGGACGGGGA 
TACGCGAAAA AACCCCCGCG GGCCCCTCCG CCGCCGCTGC OCCTGCCCCT 

24501 CGACACGTCC TCCATGGTTG GQGGACGTCG CGCCGCACCG CGTCCGCGCT 
GCTGTGCAGG AGGTACCAAC CCCCTGCA6C GCGGCGTGGC GCAGGCGCGA 

24551 C6GGGGTGGT TTCGCGCTGC TCCTCTTCCC GACTGGCCAT TTCCTTCTCC 
6CCCCCACCA AACCGCGACG AGGAGAAGG6 CTGACCGGTA AAGGAAGAGG 

24^01 TATAGGCAGA AAAAGATCAT GGAGTCAGTC GAGAAGAAGG ACAGCCTAAC 
ATATCCGTCT TTTTCTAGTA CCTCAGTCAG CTCTTCTTCC TGTCGGATTG 
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24651 CGCCCCCTCT q^'TCGCCA CCACCGCCTC CACCGATCjCC CJL'CAAq^BC 



GCGGGGGAGA CTCSAGCGGT GGTGGCGGAG GTGGCTACGG CGGTTG(51CG 

247 OL CTACCACCTT CCCCGTCGAG GCACCCCCGC TTGAGGAGGA GGAAGTGATT 
GATGGTGGAA GGGGCAGCTC CGTGG6GGC6 AACTCCTCCT CCTTCACTAA 

2475L ATCGAGCAGG ACCCAGGTTT TGTAAGCGAA GACGACGAGG ACCGCTCAGT 
TAGCTCGTCC TGGGTCCAAA ACATTCGCTT CTGCTGCTCC TG6CGAGTCA 

24801 ACCAACAGAG GATAAAAAGC AAGACCAGGA CAACGCAGAG GCAAACGAGG 
TGGTTGTCTC CTATTTTTCG TTCTGGTCCT GTTGC6TCTC CGTTTGCTCC 



24851 AACAAGTCGG GCGGGGGGAC GAAAGGCATG GCGACTACCT AGATGTGGGA 
TTGTTCAGCC CX5CCCCCCTG CTTTCCGTAC CGCTGATGGA TCTACACCCT 

24901 GACGACGTGC TGTTGAAGCA TCTGCAGCGC CAGTGCGCCA TTATCTGCGA 
CTGCTGCACG ACAACTTCGT AGACGTCGCQ GTCACGCGGT AATAGACGCT 

24951 CGCGTTGCAA GAGCGCAGCG ATGTGCCCCT CGCCATAGCG GATGTCAGCC 
GCGCAACGTT CTCGCGTCGC TACACGGGGA GCGGTATCGC CTACA6TCGG 

25001 TTGCCTACGA ACGCCACCTA TTCTCACC6C 6CGTACCCCC CAAACGCCAA 
AACGGATGCT TGCGGTGGAT AAGAGTCGCG C6CATGGGGG GTTT6CGGTT 

25051 GAAAACGGCA CATGCGA6CC CAACCCGCGC CTCAACTTCT ACCCCGTATT 
CTTTTGCCGT GTACGCTCGG GTTGGGCGCG GAGTTGAAGA TGGGGCATAA 

25101 TGCCGTGCCA GAGGTGCTTG CCACCTATCA CATCTTTTTC CAAAACTGCA 
ACGGCACGGT CTCCACGAAC GGTGGATAGT 6TAGAAAAAG GTTTTGACGT 

"25151 AGATACCCCT ATCCTGCCGT-GCCAACCGCA- GCCGA6CGGA CAAGCAGCTG-^ 
TCTATGGGGA TAGGACGGCA CCGTTGGCGT CGGCTCGCCT GTTC6TCGAC 

25201 GCCTTGCG6C AGGGCGCT6T CATACCTGAT ATC6CCTCGC TCAAC6AAGT 
CGGAACGCCG TCCCGC6ACA GTATGGACTA TAGCGGAGCG AGTT6CTTCA 

25251 GCCAAAAATC TTTGAGGGTC TTGGACGCGA CGAfiAAGCGC GCGGCAAACG 
CG6TTTTTAG AAACTCCCAG AACCTGCGCT OCTCTTCGCG CGCCGTTTGC 

25301 CTCTGCAACA GGAAAACAGC GAAAATGAAA GTCACTCTGG AGTGTTGGTG 
GA6ACGTTGT CCTTTTGTCG CTTTTACTTT CAGTGAGACC TCACAACCAC 

25351 GAACTCGA6G GTGACAACGC GCGCCTAGC'c GTACTAAAAC GCAGCATCGA 
CTTGAGCTCC CACTGTTGCG CG0Q6ATCGG CATGATTTTG CGTCGTAGCT 

25401 GGICACCCAC TTTGCCTACC CGGCACTTAA CCTACCCCCC AAGGTCATGA 
CCAGTGGGTG AAACGGATGG GCCGTGAATT GGATGGG6GG TXCCAGTACT 

25451 GCACAGTCAT GAGTGAGCTG ATCGTGCGCC GTGCGCAGCC CCTGGAGAGG 
C6TGTCAGTA CTCACTC6AC TAGCACGCGG CACGCGOCGG GGACCTCTCC 

25501 6ATGCAAATT TGCAAGAACA AACAGAGGAG GGCCTACCCG CAGTTGGCGA 
CTACGTTTAA ACGTTCTTGT TTGTCTCCTC CCGGATGGGC 6TCAACCGCT 

25551 CGAGCAGCTA GCQCGCTGGC TTCAAACGCG CGAGCCTGCC GACTTGGAGG 
GCTCGTC6AT CGCGCGACCG AAGTTTGCGC GCTCGGACGG CTGAACCTCC 
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25601 AGCGACGCAA ^^P^ATGATG GCCGCAGTGC TCGTTACCGT GGAOCI^^G 
TCGCTGCGTT TGATTACTAC CGGCGTCACG AGCAATOGCA CCTCQAACTC 

25651 TGCATGCAGC GGTTCTTTGC TGACCCG6AG ATGCACCGCA AGCTAGAOGA 
ACGTACGTCG CCAAGAAACG" ACTGGGCCTC TACGTCGCGT TCGATCTCCT 

# 

25701 AACATTGCAC TACACCTTTC GACAGGGCTA CGTACGCCAG GCCTGCAAGA 
TTGTAACGTG ATGTGGAAAG CTCTCCCGAT GCATGCGGTC CGCSACCTTCT 

25751 TCTCCAACGT GGAGCTCT6C AACCTGGTCT CCTACCTTGG AATTTTGCAC 
AGA6GTTGCA CCTCGAGACG TTGGACCA6A GGATGGAACC TTj^AAACGTG 



25801 GAAAACCGCC TTGGGCAAAA CGTGCTTCAT TCCACGCTCA AGQOCGAGGC 
CTTTTGGC66 AACCCGTTTT GCACGAAGTA AGCTGCGAGT TCCCGCTCCG 

25851 GCGCCGCGAC TACGTCCGCG ACTQCGTTXA CTTATTTCTA TQCTACACCT 
CGCGGCGCTG ATGCAGGCGC TGACGCAAAT 6AATAAAGAT ACGATGTGGA 

25901 GOCAGACGGC CATGGGCGTT TGGCAGCAGT GCTTGGAGGA GTGCAACCTC 
CCGTCT6CCG GTACCC6CAA ACCGTCGTCA CGAACCTCCT CACGTTGCAG 

25951 AAGGAGCTCC AGAAACTGCT AAAGCAAAAC TTGAAGGACC TATGGACGGC 
TTCCTCQACQ TCTTTGACGA TTTCG?TTTG AACTTCCTGG ATACCTCCCG 

26001 CTTCAACGAG CGCTCCGTGG CCGCGCACCT GGCGGACATC ATTTTCCCCG 
GAAGTTGCTC GCGAGGCACC GGCGC6TGGA CCGCCTGTAG TAAAAGGGGC 

26051 AACGCCTGCT TAAAACCCTG CAACAGGGTC TGCCAGACTT CACCAGTCAA 
TTGCGGACGA ATTTTGGGAC GTTGTCCCAG ACGGTCTGAA GTGGTCAGTT 

26101 AGCATGTTGC AGAftCTTTAG GAACTTTATC CTAGAGCGCT CAGGAATCTT 
TCCTACAACG TCTTGAAATC CTTGAAATAG 6ATCTCGCGA GTCCTTAGAA 

26151 GCCCGCCACC TGCTGTGCAC TTCCTAGCGA CTTTGTGCCC ATTAAGTACC 
CGGGCGGTGG ACGACACGTG AAGGATCGCT GAAACACGGG TAATTCATGG 

26201 GCGAATGCCC TCCGCCGCTT TGGGGCCACT GCTACCTTCT GCAGCTAGCC 
CGCTTACGGG AGGCGGCGAA ACCCCGGTGA CGATGGAAGA CGTCGATCGG 

26251 AACTACCTTG CCTACCACTC TGACATAATG GAAGACGTGA GCGGTGACGG 
TTGATGGAAC GGATGGTGAG ACTGTATTAC CTTCTGCACT CGCCACTGCC 

26301 TCTACTGGAG TGTCACTGTC GCTGCAACCT ATGCACCCCG CACCGCTCCC 
AGAT6ACCTC ACA6TGACAG CGACGTTGGA TACGTGGGGC GTGGCGAGGG 

26351 TGGTTTGCAA TTCGCAGCTG CTTAACGAAA GTCAAATTAT CG6TACCTTT 
ACCAAACGTT AAGCGTCGAC GAATTGCTTT CAGTTTAATA GCCATGGAAA 

26401 GAGCTGCAGG GTCCCTCGCC T6ACGAAAAG TCCGCGGCTC CGGGCSTTGAA 
CrCGACGTCC CAGGGAGCGG ACTGCTTTTC AOGCGCCGAG GCCCCAACTT 

26451 ACTCACTCCG GGGCT6TGGA CGTCQ3CTTA OCTTCGCAAA TTTGTACrTG 
TGAGTGAGGC CCCGACACCT GCAGCCGAAT G6AAGCGTTT AAACATGGAC 

26501 AGGACTACCA CGCCCAC6AG ATTASGrTTCT ACGAAGACCA ATCCCGCCCG 
TCCTGATGGT GCGGCTCCTC TAATCCAAGA T6CTTCTG6T TAGGGCGGGC 
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26551 CCTAATGCSG ri»TACCGC CTCCGTCATT ACCCAGGGCC ACATTd^G 
GSATTAC6CC OCGAATGGCG GftCOCWGTAA TOGOTCCCCC TGTAAGAACC 

26601 CCAATTGCAA SCCATCAACA AAGCCCGCCA AGAGTrrCTG CTACGAAA6G 
GGTTAACGTT CGGTAQTTOT TTCGGQCGGT TCTCAAAG&C GATGCTTTCC 

26651 GACGGGGGGT TOACTTGGAC CCCCAGTCCG GCGAGGAGCT CAACCCAATC 
CTGCCCCCCA AATQAACCTG GQGQK»GGC CGCTCCTC6A GTTGGGTTAG 

26701 CCCCCGCCGC CGCAGCCCTA TCA6CAGCAG CCGCGGGCCC TTGCTTCCCA 
GG66GCQGCG 0C6TCGGGAT ASTCGTCGTC GGCGCCCGSG AACGAAGGGT 

26751 GGATCGCACC CAAAftAGAAG CTGCAGCTSC CGCCGCCACC CACGGACGAG 
CCTACCGTOG GrmTCTTC GSCGTCGACG GCGCCGGTGG GTGCCTGCTC 

56001 GAGGAATACT GGGACAGTCA GGCAGftGGAG GTTTTGGACG AGGA0GA6SA 

CCCroXCAGX CCGTCTCCTC CAAAACCTGC TCCTCCTCCT 

26851 GGACATCATC GAAGRCTGGG AGAGCCTAGA CGAGGAAGCT TCCGAGGTCG 
CCTCTACTAC CTTCTGACCC TCTCGGATCT GCTCCTTCQA AGGCTCCAGC 

26901 AAGAGGTCTC AGACGAARCA CCGTCACCCT CGGTCGCATT CCCCT^G 
TTCTCCACAG TCTGCTTTGT OGCAGTGSGA GCCAGCGTAA GGGGAGCGGC 

26951 GCGCCCCAGA AATCGGCAAC CGGTTCCAGC ATGGCTACAA CCTCC^TCC 
CGCGGGGTCT TTAGCCGXTG GCCAAGGTCO TACCGATGTT SSAGGCSAGS 

27001 TCAGGCGCCG CCGGCACTGC CCGTTCGCCG ACCCAACCGT Afi&TGGGACA 

agtSgcggc ggccgtgacg ggcaagcggc tgggttggca tctacoctot 

~27051 "^CTGGAAC CAG3GCCGGT AAGTCCAAGC AGCCGCCGCC GmAGCCCAA 

ggtgaccttg gtcccggcca ttcrggttcg tcggcggcgg caatcgggtt 

27101 GAGCAACAAC AGCGCCAAGG CTACCCCTCA TGGCGCGGGC ^CAAGAS^GC 
CTCGTTGTTG TCGCGCrTCC G&TCGCGAGT ACCGCGCCCG TGTTCTTGCG 

27151 CATAGTTGCT TGCTTGCAAG ACTGTGGGGG CAACATCTCC TTCGCCCGCC 
GTATCAACGA ACGAACGTTC TGACACCCCC GTTGTAGAGG AAGCGGGCGG 

^>T5ni CPTTOCTTCT CTACCATCAC GGCGTGGCCT TCCCCCGTAA CATCCTGCAT 
cSSSI StLtAOTG CCGCACCGGA AGGGGGCATT GTAGGACGTA 

27251 TACTACCSTC ATCTCTACAG CCCATACTGC ACCGGCGGCA GCGGCAGCAA 
ATSATGGCAG TAGAGATCTC GGGTATGACG TGGCCGCCGT CGCCGTCGTT 

^•J-^ni eASCAQCOGC CACACAGAAG CAAAGGCGAC CGGATAGCAA 6ACTCTGACA 
GTCCTCGCCG 6TGTGTCTTC GTTTCCGCTC GCCTATCGTT CTGAGACTGT 

27351 AAGCCCAAGA AATCCACAGC GGCGGCAGCA GCAGGAG6AG GAGCGCTGCC 
TTCSGGTTCT TTAGGTCTCG CCGCC6TCGT CGTCCTCCTC CTCGCGACGC 

27401 TCTGGCGCCC A&C6AACCCG TATCGACCCG CGAGCTTAGA ";CA^TTT 
AGACCOCGGG TTCCTTGGGC ATAGCTGGGC GCTCGAATCT TTGTCCTAAA 

27451 TTCCCACTCT GTATGCTATA TTTCAACAGA GCAGQGGCCA AGAACAMAG 
AAOGGTGAGA CATACGATAT AAAGTTGTCT CGTCCCCGGT TCTTGTTCTC 
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27501 CTGAAAATAA Ji^.CAGGTC TCTGCGATCC CTCACCCGCA GCTGCq»A 
GACTTTTATT ^mCTCCAG A6ACGCTAG6 GAGTGGGCGT CGACGgJHaT 

27551 TCACAAAAGC GaAGATCAOC TTCG0C6CAC GCTGGAAGAC GCGGAGGCTC 
AGTGTTTTCG CTTCTAGTCG AAGCCGCGTG OGACCTTCTG CGCCTCCGAG 

27601 TCTTCAGTAA ATACTGCGC6* CTGACTCTTA AGGACTA6TT TC6CGCCCTT 
AGAACTCATT TATGACGCGC GACT6AGAAT TCCTGATCAA AGCGCGGGAA 

27651 TCTCAAATTT AAGCGCGAAA ACTACGTCAT CTCCAGCGGC CACACCCGGC 
AGAGTTTAAA TTCGCGCTTT TQATGCAGTA GAGGTCGCCG GTGTG6GCCG 



27701 GCCAGCACCT GTTGTCAGCG CCATTATGAG CAAGGAAATT CCCACGCCCT 
CGGTCGTGGA CAACAGTCGC GGTAAIACTC GTTCCTTTAA GGGTGCG6GA 

27751 ACATGTGGAQ TTACCAOCCA CAAATGGGAC TTGCGGCTGG AGCTGCCCAA 
TCTACACCTC AATGGTCG6T 6TTTACCCTG AACGCCGACC TCSACGGGTT 

27801 GACTACTCAA CCCGAATAAA CTACATGAGC GCGGGACCCC ACATGATATC 
CTGATGAGTT GOGCTTATTT GATGTACTCG CGCCCTGGGG TGTACTATAG 

• 27851 CCGGGTCAAC GGAATACGCG CCCACCGAAA CCGAATTCTC CTGGAACAGG 
GGCCCAGTTC CCTTATCCGC GGGTGGCTTT GGCTTAAGAG GACCTTGTCC 

27901 CGGCTATTAC CACCACACCT CGTAATAACC TTAATCCCCG TAGTTGGCCC 
GCCGATAATG GT6GTGTGGA GCATTATT6G AATTAGGGGC ATCAACCGGG 

27951 GCTGCCCTGG TGTACCAGGA AAGTCCCGCT CCCACCACTG TGGTACTTCC 
CGACGGGACC ACATGGTCCT TTCAGGGCGA GGGTGGTGAC ACCATGAAGG 

28001 CAGAGACGCC CAGGCCGAAG TTCAGATGAC TAACTCAGGG GCGCAGCTTG 
GTCTCTGCGG GTCCGGCTTC AAGTCTACTG ATTGAGTCCC CGCGTCGAAC 

28051 CGGGCGGCTT TCGTCACAGG GTGCGGTCGC CCGGGCAGGG TATAACTCAC 
GCCCGCCGAA AGCAGTGTCC CACGCCAGCG GGCCCGTCCC ATATTGAGTG 

28101 CTGACAATCA GAGGGCGAGG TATTCAGCTC AACGACGAGT CGGTGAGCTC 
GACTGTTAGT CTCCCGCTCC ATAAG?CGAG TTGCTGCTCA GCCACTCGAG 

28151 CTCGCTTGGT CTCCGTCCGG ACGGGACATT TCAGATCGGC GGCGCCGGCC 
GAGCGAACCA GAGGCAGGCC TGCCCCCTAA AGTCTAGCCG CCGCGQCCGG 

28201 GCTCTTCATT CACGCCTCGT CAGGCAATCC TAACTCTGCA GACCTCGTCC 
CGAGAAGTAA GTGCGGAGCA GTCCGTTAGG ATTGAGACGT CTGGAGCAGG 

28251 TCTGAGCCGC GCTCTGGAGG CATTGGAACT CTGCAATTTA TTGAOQAQTT 
AGACTC6GCG CGAGACCTCC GTAACCTTCA GACGTTAAAT AACTCCTCAA 

28301 TGTGCCATCG GTCTACTTTA ACCCCTTCTC OGGACCTCCC GGCCACTATC 
ACACGGTAGC CAGATGAAAT TGGGGAAGAG CCCTOGAGGG CCOOTGATAG 

2B351 CGGATCAATT TATTCCTAAC TTTGACGCG6 TAAAGGACTC OOCOQACGGC 
GCCTAGTTAA ATAAGGATTG AAACTGCGCC ATOTCCT6AG CC6CCTGCCG 

28401 TACGACTQAA TQTTAAGTG6 AGACGCAGAC CAACTGCGCC TGAAACACCT 
ATGCTGACTT ACAATTCACC TCTCCGTCTC GTTOACOCGG ACTTTQTG6A 
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28451 GCTCCACTGT dWtGCCACA AGTCCTTTGC CCGCGkCTCC GGTGAdWT 
CCAGGTGACA GCGGCGGTGT TCACGAftACG GGCGCTGAGG CCACTCAAAA 

28501 GCTACTTTGA ATTGCCCGAG GATCATXTCG AGGGCCCGGC GCACGGCGTC 
CGATGAAACT TAACGGGCTC CTAGTATAGC TCCCG6GCCG CGTGCCGCAG 

28551 CGGCTTACCG CCCAG6GAGA GCTTGCCCGT AGCCTGATTC GGGAGTTTAC 
GCCGAATGGC OGGTCCCTCT CGAACGGGCA TCGGACTAAG CCCTCAAATG 

28601 CCAGCX5CCCC CTGCTAGTTG AGCGGGACAG GGGACCCTGT GTTCTCACTG 
GGTCGCGGGG GACGATCAAC TCGCCCTGTC CCCTGGGACA CAAGAGTGAC 



28651 TGATTTGCAA CTGTCCTAAC CCTGGATTAC ATCAA6ATCT TTGTTGCCAT 
ACTAAACGTT GACAGGATTG GGACCTAATG TAGTTCTAGA AACAACGGTA 

2B701 CTCTGTGCTG AGTATAATAA ATACAGAAAT TAAAATATAC TGGGGCTCCT 
GAGACACGAC TCATATTATT TATGTCTTTA ATTTTATATG ACCCCGAGGA 

28751 ATCGCCATCC TGTAAACGCC ACCGTCTTCA CCCGCCCAAG CAAACCAAGG 
TAGCGGTAGG ACATTTGCGG TGGCAGAAGT GGGCGGGTTC GTTTGGTTCC 

28801 CGAACCTTAC CTGGTACTTT TAACATCTCT CCCTCTGTGA TTTACAACAG 
GCTTGGAATG GACCATGAAA ATTGTAGAGA GGGAGACACT AAATGTTGTC 

28651 TTTCAACCCA GACGGAGTGA GTCTACGAGA GAACCTCTCC GAGCTCAGCT 
AAAGTTGGGT CTGCCTCACT CAGAT6CTCT CTTGGAGACG CTCGAGTCGA 

28S01 ACTCCATCAG AAAAAACACC ACCCTCCTTA CCTGCCGGGA ACGTACGAGT 
TGAGGTAGTC TTTTTT6TGG TGGGAGGAAT GGACGGCCCT TGCATGCTCA 



28^51 GCGTCACCGG CCQCTGCACC ACACCTACCG CCTGACCGTA AACCAGACTT 
CGCAGTGGCC GGOGACGTGG TGTGGATGGC GGACTGGCAT TTGGTCTGAA 

29001 TTTCCGGACA 6ACXTCAATA ACTCT6TTTA CCAGAACAGG AGGTGAGCTT 
AAAGGCCTGT CTG6AGTTAT TGAGACAAAT GGTCTTGTCC TCCACTCGAA 

29051 A6AAAACCCT TAGGGTATTA GGCCAAAGGC GCAGCTACT6 TGGGGTTTAT 
TCTTTTGGGA ATCCCATAAT CCGGTTTCC6 CGTCGATGAC ACCCCAAATA 

29101 GAACAATTCA AGCAACTCTA CGGGCTATTC TAATTCA6GT TTCTCTAGAA 
CTTGTTAAGT TCGTTGAGAT GCCCGATAA6 ATTAAGTCCA AAGAGATCTT 

29151 TCG6GGTTCG GGTTATTCTC TGTCTTGTGA TTCTCTTTAT TCTTATACTA 
AGCCCCAACC CCAATAAGAG ACAGAACACT AAGAGAAATA AGAATATGAT 

29201 ACGCTTCTCT GCCTAAGGCT CGCCGCCTGC TGTGTGCACA TTTGCATTTA 
TGCGAA6AGA CGGATTCCGA GCGGCGGACG ACACACGTGT AAACGTAAAT 

29251 TTGTCAGCTT TTTAAACGCT GGGGTCGCCA CCCAAGATGA TTAGGTACAT 
AACAGTCGAA AAATTTGC6A CCCCAGCGGT GGGTTCTACT AATCCATGTA 

29301 AATCCTAGGT TTACTCACCC TTGCGTCAGC CCACGGTACC ACCCAAAAGG 
TTACGATCCA AATGAGT666 AACGCAGTCG GGTGCCATGG TGGGTTTTCC 

29351 TCGATTTTAA GGAGCCAGCC TGTAATGTTA CATTCGCAGC TGAAGCTAAT 
ACCTAAAATT CCTCGGTCGG ACATTACAAT GTAAGCGTCG ACTTCGATTA 
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29401 GAGTGCACCA CMTTATAAA ATGCACCACA GAACATCAAA AWtrJ\iK«mT 
CTCACGTGGT GAGAATATTT TAC6TGGTGT CTTGTACTTT TCQACG^&A 

29451 TCGCCACAAA AACAAAATTG GCAAGTATCC TQTTTATGCT ATTTGGCAGC 
AGCGGTGTTT TTGTTTCAAC CGTTCATACG ACAAATACGA TAAACCGTCG 

29501 CAGGTGACAC TACAGAGTAT AAT6TTACAG TTTTCCAGQG TAAAAGTCAT 
GTCCACTGTG ATGTCTCATA TTACAATGTC AAAAGGTCCC ATTTTCAGTA 

29551 AAAACTTTTA TGTATACTTT TCCATTTTAT GAAATGTGCG ACATTACCAT 
TTTTGAAAAT ACATAT6AAA AGGTAAAATA CTTTACACGC TGTAATGGTA 



29601 GTACATGAGC AAACAOTATA AGTTGT6GCC CCCACAAAAT TGTGTGGAAA 
CATGTACTCG TTTGTCATAT TCAACACCQQ GGGTOTTTTA ACACACCTTT 

29651 ACACTGGCAC TTTCTOCTOC ACTGCTATOC TAATTACAGT GCTCGCTTTG 
TGTOACCGTG AAASACGACG TGACGATACG ATTAAT6TCA CGA6CGAAAC 

29701 GTCTGTACCC TACTCTATAT TAAATACAAA AGCAGACGCA GCTTTATTGA 
CAGACATGGG ATGAGATATA ATTTATGTTT TCGTCT6CGT CGAAATAACT 

29751 GGAAAAGAAA ATGCCTTAAT TTACTAAGTT ACAAAGCTAA TGTCACCACT 
CCTTTTCTTT TACGGAATTA AATGATTCAA TGTTTC6ATT ACAGTGGTGA 

29801 AACTGCTTTA CTCGCTCCTT GCAAAACAAA TTCAAAAAGT TAGCATTATA 
TTGACGAAAT GAGCGACGAA CGTTTTGTTT AAGTTTTTCA ATCGTAATAT 

29851 ATTAGAATAG GATTTAAACC CCCCG6TCAT TTCCTGCTCA ATACCATTCC 
TAATCTTATC CTAAATTTGG GGGGCCAGTA AAGGACGAGT TATGGTAAGG 

29901 CCTGAACAAT "TGACTCTATG TGGGATATGC TCCAGC6CTA CAACCTTGAA 
GGACTTGTTA ACTGAGATAC ACCCTATACG AGGTC6CGAT GTTGGAACTT 

29951 GTCAGGCTTC CTGGATGTCA GCATCTGACT TTGGCCAGCA CCTGTCCCGC 
CAGTCCGAA6 GACCTACAGT CGTAGACTGA AACCGGTCGT GGACAGGGCG 

30001 GGATTTGTTC CAGTCCAACT ACAGCGACCC ACCCTAACAG AGATGACCAA 
CCTAAACAAG GTCAGGTTGA TGTCGCTGGG TGGGATTGTC TCTACTGGTT 

30051 CACAACCAAC GCGGCCGCCG CTACCGGACT TACATCTACC ACAAATACAC 
GTGTTGGTTC CGCCGGCGGC GATGGCCTGA ATGTAGATGG TGTTTATGTG 

30101 CCCAAGTTTC TGCCTTTGTC AATAACTGGG ATAACTTGGG CAT6TGGTGG 
GGGTTCAAAG ACGGAAACAG TTATTGACCC TATTGAACCC GTACACCACC 

30151 TTCTCCATAG CGCTTATGTT TGTATGCCTT ATTATTATGT GGCTCATCTG 
AAGAGGTATC GCGAATACAA ACATACGGAA TAATAATACA CCGAGTAGAC 

30201 CTGCCTAAAG CGCAAACGCG CCCGACCACC CATCTATAGT CCCATGATTG 
GACGGATTTC GCGTTTGCGC GGGCTGGTGG GTAGATATCA GGGTAGTAAC 

30251 TGCTACACCC AAACAATGAT QGAATCCATA GATTGGACOQ ACTGAAACAC 
ACGATGTGGG TTTGTTACTA CCTTASGTAT CTAACCTGCC TGACTTTGTG 

30301 ATGTTCTTTT CTCTTACAGT ATGATTAAAT GAGACATGAT TCCTCGAGTT 
TACAA6AAAA GAGAATGTCA TACTAATTTA CTCTGTACTA AGGAGCTCAA 
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30351 TTTATATTAC iBpCCTTGT TSCGCTTTTT TGTGCGTGCT CCACAT^C 
AAATATAATG ACTOGGAACA ACGCGAAAAA ACACGCACGA QGTGTAfiCcG 

30401 TGCGGTTTCT CACATCGAAG TAGACTGCAT TCCAGCCTTC ACAGTCTATT 
ACGCCAAAGA GTGTA6CTTC ATCTGACGTA AGGTCGGAAG TGTCA6ATAA 

30451 T6CTTTACGG ATTTGTCACC CTCACGCTCA TCTGCAGCCT CATCACTCT6 
ACGAAATGCC TAAACAGTCG GAGTGCSAGT AGACGTCGGA GTAGTGACAC 

30501 GTCATCCCCT TTATCCAGTC CATTGACT6G GTCTGTGTGC GCTTTGCATA 
CAGTAGCG6A AATA6GTCAC GTAACTGACC CAGACACAC6 CX3AAACGTAT 



30551 TCTCAGACAC CATCCCCAGT ACAGGGACAG GACTATAGCT GAGCTTCTTA 
AGA6TCTGTG 6TAGGGGTCA TGTCCCTGTC CTGATATCGA CTCGAAGAAT 

30601 GAATTCTTTA ATTATQAAAT TTACTGTGAC TTTTCTGCTG ATTATTTGCA 
CTTAAGAAAT TAATACTTTA AATCACACTG AAAAGACGAC TAATAAACGT 

30651 CCCTATCTGC GTTTTCTTCC CCGACCTCCA AGCCTCAAAG ACATATATCA 
GGGATAGACG CAAAACAAGG GGCTGGAGGT TCGGAGTTTC TGTATATAGT 

30701 TGCAGATTCA CTCGTATATG GAATATTCCA AGTTGCTACA ATGAAAAAAG 
ACGTCTAAGT GAGCATATAC CTTATAAGGT TCAACGATGT TACTTTTTTC 

30751 CGATCTTTCC GAAGCCTGGT TATATGCAAT CATCTCTGTT ATGGTGTTCT 
GCTAGAAAGG CTTCGGACCA ATATACGTTA GTAGAGACAA TACCACAAGA 

30801 GCAGTACCAT CTTAGCCCTA GCTATATATC CCTACCTTGA CATTGGCTGG 
CGTCATGGTA GAATCGGGAT CGATATATAG GGATGGAACT GTAACCGACC 

30B51~AACGCAATAG ATGCCATGAA CCACCCAACT TTCCCCGCGC-CCGCTATGCT 
TTGCGTTATC TACGGTACTT GGTGGGTTGA AAGGGGCGCG GGCGATACGA 

30901 TCCACTGCAA CAAGTTGTTG CCGGCGGCTT tgtcccagcc aatcagcctc 

aggtgacgtt gttcaacaac ggccgccgaa acagggtcgg ttagtcggag 

30951 gcccaccttc tcccaccccc actgaaatca gctactttaa tctaacagga 
cgggtggaag agggtggggg tgactttagt cgatgaaatt agattgtcct 

31001 ggagatgact gacaccctag atctagaaat ggacggaatt attacagagc 
cctctactga ctgtgggatc tacatcttta cctgccttaa taatgtctcg 

31051 agcgcctgct agaaagacgc agggcagcgg ccgaccaaca gcgcatgaat 

TCGCGGACGA TCTTTCTGCG TCCCSTC6CC GCCTCGTTGT CGC6TACTTA 

31101 CAAGAGCTCC AAGACATGGT TAACTTGCAC CAGTGC AAAA GGGGTATCTT 
GTTCTCGAQ6 TTCTGTACCA ATTGAACGTC GfTCACGTTTT CCCCATAGAA 

31151 TTGTCTCGTA AAGCAGGCCA AAGTCACCTA CGACAGTAAT ACCACCGGAC 
AACA6AGCAT TTCSTCCGGT TTCAGTGGAT GCTGTCATTA TGGTGGCCTG 

31201 ACCGCCTTAG CTACAAGTTG CCAACCAAGC GTCAGAAATT CGTGGTCATG 
T6QCGGAATC GATGTTCAAC G6TTG6TTCG CAGTCTTTAA CCACCAGTAC 

31251 GTGGGAGAAA AGCCCATTAC CATAACTCAG CACTCGGTAG AAACCGAAGG 
CACCCTCTTT TCGGGTAATG GTATTGAGTC GTGAGCCATC TTT6GCTTCC 
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31301 CTGCATTCAC rj^rCTTGTC AAGGACCTGA GCATCTCTGC ACCCTq|»A 
GACGTAAGTC aHgGAACAG TTCCTGGACT CCTAQAGACG TGGGAaSRaT 

31351 AGACCCTGTG CG6TCTCAAA GATCTTATTC CCTTTAACTA AT AAAAAAA A 
TCTGGGACAC GCCAGAGTTT CTAGAATAAG GGAAATTCAT TATTTTTTTT 

31401 ATAATAAAOC ATCACTTACT TAAAATCAGT TAGCAAATTT CTGTCCAGTT 
TATTATTTCG TAGTGAATGA ATTTTAGTCA ATCGTTTAAA GACAGGTCAA 

31451 TATTCAGCAG CACCTCCTTG CCCTCCTCCC A6CTCTGGTA TTGCAGCTTC 
A7AAGTCGTC GTGGAGGAAC GGGA6GAGGG TCGAGACCAT AACGTCGAAG 



31501 CTCCTGGCTG CAAACTTTCT CCACAATCTA AATGGAATGT CAGTTTCCTC 
GAGGACCGAC OTTTGAAAQA GGTGTTAGAT TTACCTTACA GTCAAAGGAG 

31551 CTGTTCCTGT CCATCCGCAC CCACTATCTT CATGTTGTTG CACATGAAGC 
6ACAAGGACA G6TAG6CGT6 GGTGATAGAA GTACAACAAC GTCTACTTCG 

31601 GCGCAAGACC GTCTGAAQAT ACCTTCAACC CCGTGTATCC ATATGACACG 
CGCGTTCT6G CAGACTTCTA TGGAAGTTGG GGCACATAGG TATACTGTGC 

31651 GAAACCGGTC CTCCAACTGT GCCTTTTCTT ACTCCTCCCT rTGTATCCCC 
CTTTCGCCAC CAGGTTGACA CGGAAAAGAA TGAGGAGGGA AACATAGGG6 

31701 CAATCGCTTT CAAGA6AGTC CCCCTGGGGT ACTCTCTTTG CGCCTATCCG 
GTTACCCAAA GTTCTCTCAG GGGGACCCCA TGAGAGAAAC GCGGATAGGC 

31751 AACCTCTAGT TACCTCCAAT GGCATGCTTG CGCTCAAAAT GGGCAACGGC 
TTGGAGATCA ATGGAGGTTA CCGTACGAAC GCGAGTTTTA CCCGTTGCCG 

31B01 CTCTCTCTGG ACC3AGGCCGG CAACCTTACC TCCCAAAATG TAACCACTGT 
GAGAGAGACC TGCTCCGGCC GTTGGAATGG AGGGTTTTAC ATTGGTGACA 

31851 GAGCCCACCT CTCAAAAAAA CCAAGTCAAA CATAAACCTG GAAATATCTG 
CTCGGGTGGA GAGTTTTTTT GGTTCAGTTT GTATTTGGAC CTTTATAGAC 

31901 CACCCCTCAC AGTTACCTCA GAAGCCCTAA CTGTGGCTGC CGCCGCACCT 
GTGGGGAGTG TCAATGGAGT CTTCG3GATT GACACCGACG GCGGCGTGGA 

31951 CTAATGGTCG CGGGCAACAC ACTCACCATG CAATCACAGG CCCCGCTAAC 
GATTACCAGC GCCCGTTGTG TGAGTGGTAC GTTAGTGTCC GGGGCGATTG 

32001 CGTGCACGAC TCCAAACTTA GCATTGCCAC CCAAGGACCC CTCACAGTGT 
GCACGTGCTG AGGTTTGAAT CGTAACGGTG 6GTTCCT66G GAGTGTCACA 

32051 CAGAAGGAAA GCTAGCCCTG CAAACATCA6 GCCCCCTCAC CACCACCGAT 
GTCTTCCTTT CGATCCCCAC GTTTGTAGTC CGGGGGAGTG GTGGTGGCTA 

32101 AGCAGTACCC TTACTATCAC TGCCTCACCX; CCTCTAACTA CTGCCACT6G 
TCGTCATGGG AATGATAGTG ACGeaGTGGO GGAGATTGAT GACGGT6ACC 

32151 TAGCTTGGGC ATTGACTTGA AAGAGCCCAT TTATACACAA AATGGAAAAC 
ATCGAACCCG TAACTGAACT TTCTCGGGTA AATATGTGIT TTACCTTTTG 

32201 TAGGACTAAA GTACGGGGCT CCTTTGCATG TAACAGAC6A CCTAAACACT 
ATCCTGATTT CATGCCCCGA GGAAACGTAC ATTGTCTQCT GGATTTGTGA 
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32251 TTGACCGTAG M|CTGGTCC AGGTGTGACT ATTAATAj*MH:1TCd|||!:A 
AACTGGCATC ■tGACCAGG TCCACACTGA TAATTATTAT GA&GG^PsT 

32301 AACTAAAGTT ACTGGAGCCT TGGGTTTTGA TTCACAAGGC aatatgcaac 
TTGATTTCAA TGACCTCGGA ACCCAAAACT AAGTGTTCCG XTATACGTTG 

32351 TTAATGTAGC AGGAGGACTA AGGATTGATT CTCAAAACAG ACGCCTTATA 
AATTACATCG TCCTCCTGAT TCCTAACTAA GAGTTTTGTC TGCGGAATAT 

32401 CTTGATGTTA GTTATCCGTT TGATGCTCAA AACCAACTAA ATCTAAGACT 
GAACTACAAT CAATAGGCAA ACTACGAGTT TTGGTTGATT TAGATTCTGA 



32451 AGGACAGGGC CCTCTTTTTA TAAACTCAGC CCACAACTTG GATATTAACT 
TCCTGTCCCG GGAGAAAAAT ATXTGAGTCG GGTGTTGAAC CTATAATTGA 

32501 ACAACAAAGG CCTTTACTT6 TTTAC7W3CTT CAAACAATTC CAAAAACCTT 
TGTTGTTTCC GGAAATGAAC AAATGICGAA GTTTGTTAA6 GTTTTTCGAA 

32551 GAGGTTAACC TAAGCACTGC CAAGGGGTTG ATGTTTGACG CTACAGCCAT 
CTCCAATTGG ATTCGTGACG GTTCCCCAAC TACAAACTGC GATGTCGGTA 

32601 AGCCATTAAT GCAGG^GATG GGCTTGAATT TGGTTCACCT AATGCACCAA 
rCCGTAATTA CCTCCTCTAC CCGAACTTAA ACCAAGTGGA TTACGTGGTT 

32651 ACACAAATCC CCTCAAAACA AAAATTGGCC AT6GCCTAGA ATTTGATTCA 
TGTGTTTA6G SGAGTTTTGT OTTTAACCGG TACCG6ATCT TAAACTAAGT 

32701 AACAAGGCTA TGGTTCCTAA ACTAGGAACT GGCCTTAGTT TTGACAGGAC 
TTGTTCCGAT ACCAAGGATT TGATCCTTGA CCGGAATCAA AACTGTCGTG 

-32751— AGGTGCCATT- ACAGTAGGAA ACAAAAATAA^TGATAAGCTA- ACTTTGTGGA 
TCCACGGTAA TGTCATCCTT TGTTTTTATT ACTATTCGAT TGAAACACCT 

32B01 CCACACCAGC TCCATCTCCT AACTGTAGAC TAAATGCAGA GAAAGATGCT 
GGTGTGGTCG AGGTAGAGGA TTGACATCTG ATTTACGTCT CTTTCTACGA 

32851 AAACTCACTT TGGTCTTAAC AAAATGTGGC AGTCAAATAC TTGCTACAGT 
TTTGAGTGAA ACCAGAATTG TTTTACACCG TCAGTTTATG AACGATGTCA 

32901 TTCAGTTTTG GCTGTTAAAG GCAGTTTGGC TCCAATATCT GGAACAGTTC 
AAGTCAAAAC CGACAATTTC CGTCAAACC6 AGGTTATAGA CCTTGTCAA6 

32951 AAAC3TGCTCA TCTTATTATA AGATTTGACG AAAATGGAGT GCTACTAAAC 
TTTCACGAGT AGAATAATAT TCTAAACTGC TTTTACCTCA CGATGATTTG 

33001 AATTCCTTCC TGGACCCAGA ATATTGGAAC TTTAGAAATG 6AGATCTTAC 
TTAAGGAAGG ACCTGGGTCT TATAACCTTG AAATCTTTAC CTCTA6AATG 

33051 T6AAG6CACA GCCTATACAA ACGCTGTTGG ATTTATGCCT AACCTATCAG 
ACTTCCGTGT CGGATATGTT TGCGACAACC TAAATACGGA TTGGATAGTC 

33101 CTTATCCAAA ATCTCACGGT AAAACTGCCA AAAGTAACAT TGTCAGTCAA 
GAATAGGTTT TAGAGTGCCA TTTTGACGGT TTTCATTGTA ACAGTCAGTT 

33151 GTTTACTTAA ACGGAGACAA AACTAAACCT GTAACACTAA CCATTACACT 
CAAATGAATT TGCCTCTGTT TTGATTTGGA CATTGTGAT? GGTAATGTGA 
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33201 AAAC66TACA C^BAAACAG GAGACACAAC TCCAAGTG&A 7£Cte'fitt|T 
TTTGCCATGT CTWTTTGTC CTCTGTGTTG AGGTTCACCT ATSAG^HKa 

33251 CATTTTCATG G6ACTGGTCT GGCCACAACT ACATTAATGA AATATTTGCC 
GTAAAA(3TAC CCTGACCAGA CC6GTGTTGA TGTAATTACT TTATAAAC6G 

33301 ACATCCTCTT ACACTTTTTC ATACATTGCC CAAGAATAAA GAATCGTTTG 
TGTAGGAGAA TGTGAAAAAG TATGTAAC6G GTTCTTATTX CTTAGCAAAC 

33351 TGTTATGTTT CAACGT6TTT ATTTTTCAAT TGCAGAAAAT TTCAAGTCAT 
ACAATACAAA GTTGCACAAA TAAAAAGTTA ACGTCTTTTA AAGTTCAGTA 



33401 TTTTCATTCA GTAGTATAGC CCCACCACCA CATAGCTTAT ACAGATCACC 
AAAAGTAAGT CATCATATCG GGGTGGTG6T GTATCGAATA TGTCTAGTGG 

33451 GTACCTTAAT CAAACTCACA GAACCCTAGT ATTCAACCTG CCACCTCCCT 
CATGGAATTA GTTTGAGTGT COTGGGATCA TAAGTTGGAC 6CTGGAGGGA 

. 33501 CCCAACACAC AGAGTACACA GTCCTTTCTC CCCGGCT6GC CTTAAAAAGC 
GGGTTGTGTG TCTCAT6TGT CAGGAAAGAG GGGCCGACCG GAATTTTTCG 

33551 ATCATATCAT GGGTAACAGA CATATTCTTA GGTGTTATAT TCCACACGGT 
TAGTATAGTA CCCATTGTCT GTATAAGAAT CCACAATATA AGGTGTGCCA 

33601 TTCCTGTCGA GCCAAACGCT CATCAG'TGAT ATTAATAAAC TCCCCGGGCA 
AAGGACAGCT CCGTTT6CGA GTA3TCACTA TAATTATTTG AGGGGCCCGT 

33651 GCTCACTTAA CTTCATGTCG CTGTCCAGCT GCTGAGCCAC AQGCTGCTGT 
CGAGTGAATT CAAGTACAGC GACAGGTCGA CGACTCGGTG TCCGACGACA 

33701 CCAACTTGCG GTTGCTTAAC GGGCGGCGAA GGAGAAGTCC ACGCCTACAT 
GGTTGAACGC CAACGAATTG CCCGCCGCTT CCTCTTCAGG TGCGGATGTA 

33751 GGGGGTAGAG TCATAATCGT GCATCAGGAT AGGGCGGTGG TGCTGCAGCA 
CCCCCATCTC AGTATTAGCA CGTAGTCCTA TCCCGCCACC ACGACGTCGT 

33801 GCGCGCGAAT AAACTGCTGC CGCCGCCGCT CCGTCCTGCA GGAATACAAC 
CGCGCGCTTA TTTGACGACG GCQGCGCCGA GGCAGGACGT CCTTATGTTG 

33851 ATGGCAGTGG TCTCCTCAGC GATGATTCGC ACCGCCCGCA GCATAAGGCG 
TACC6TCACC AGAGGAGTCG CTACTAAGCG TGGCGGGCGT CGTATTCCGC 

33901 CCTTGTCCTC CGGGCACAGC AGCOCACCCT GATCTCACTT AAATCAGCAC 
GGAACAGGAG GCCC6TGTCG TCGCGTCGGA CTAGAGTGAA TTTAGTCGTG 

33951 AGTAACTGCA GCACAGCACC ACAATATTGT TCAAAATCCC ACAGTGCAAG 
TCATTGACGT CSTGTCGTGG TGTTATAACA AGTTTTAGG6 TGTCACGTTC 

34001 GCGCTGTATC CAAAGCTCAT G6CGGGGACC ACAGAACCCA OOTGGCCATC 
CGCGACATAG GTTTCGAGTA CCGCCCCTGG TGTCTTGG6T GCACCQQTAQ 

34051 ATACCACAAG CGCAGGTAGA TTAAGTGQCG ACCCCTCATA AACACGCT6G 
TAT6GTGTTC GCGTCCATCT AATTCACCX3C TGGGGAiSTAT TTGTGCGACC 

34101 ACATAAACAT TACCTCTTTT GGCATGTTGT AATTCACCAC CTCCC6GTAC 
TGTATTTGTA ATOGAGAAAA CCGTACAACA TTAAGTGGTG GA6GGCCATG 
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34151 CATATAAACC MBCATTAAA CATGGCGCCA TCCACCACCA~TCCTMM|A ' 
GTATATTTGG otRtAATTT GTACCQCOGT AGGTGGTGGT AGGAtJKt 

34201 gctggccaaa acctgcccgc cggctataca ctgcagggaa ccgggactgg 
cgaccggttt tggacgggcg gccgatatgt gacgtccctt ggccctgacc 

34251 aacaatgaca gtggagagcc caggactc6t aaccatggat catcatgctc 
ttgttactgt cacctctcgg gtcctgagca ttggtaccta gtagtacgag 

34301 gtcatgatat caatgttggc acaacacagg cacacgtgca tacacttcct 

CAGTACTATA GTTACAACCG TGTTGTGTCC GTGTGCACGT ATGTGAAGGA 



34351 CAGGATTACA AGCTCCTCCC GCGTTAGAAC CATATCCCAG GGAACAACCC 
GTCCTAATGT TC6AGGAGGG CGCAATCTTG GTATAGGGTC CCTTGTTGGG 

34401 ATTCCTGAAT CAGCGTAAAT CCCACACTGC AGGGAAGACC TCGCACGTAA 
TAAGGACTTA GTCGCATTTA GGGTGTGACG TCCCTTCTOG AGCGTGCATT 

34451 CTCACGTTGT GCATTGTCAA AGTGTTACAT TCGGGCAGCA GCGGATGATC 
6AGTGCAACA CGTAACAGTT TCACAATGTA AGCCCGTCGT CGCCTACTAG 

34501 CTCCAGTATG GTAGCGCGGG TTTCTGTCTC AAAAGGAGGT AGACGATCCC 
GAGGTCATAC CATCGCGCCC AAAGACAGAG TTTTCCTCCA TCTGCTAGGG 

34551 TACTGTACGG AGTGCGCCGA GACAACCGAQ ATCGT6TTGG TCGTAGTGTC 
ATGACAT6CC TCACGCG6CT CTGTTGGCTC TAGCACAACC AGCATCACAG 

34601 ATGCCAAATG GAACGCCG6A CGTAGTCATA TTTCCT6AAG CAAAACCAGG 
TACGGTTTAC CTTGCGGCCT GCATCA6TAT AAAGGACTTC GTTTTGGTCC 

-34651 TGGGGGCGTG-ACAAACAGAT-^CTGCGTCTCC-GGlCTCGCrG-CTTAGATCGC. 

ACGCCCGCAC TGTTTGTCTA GACGCA6AGG CCACAGCGGC GAATCTAGCG 

34701 TCTGT6TAGT A6TTGTAG7A TATCCACTCT CTCAAAGCAT CCAGGCGCCC 
AGACACATCA TCAACATCAT ATAGGTGAGA GAGTTTCGTA GGTCCGCGGG 

34751 CCTGGCTTCG GGTTCTATGT AAACTCCTTC ATGCGCCGCT GCCCTGATAA 
GGACCGAAGC CCAAGATACA TTTGAGGAAG TACGCGGCGA CGGGACTATT 

34801 CATCCACCAC CGCAGAATAA 6CCACACCCA GCCAACCTAC ACATTCGTTC 
GTASGTGGTG GCGTCTTATT CGGfTGTGGGT CGGTTGGATG TGTAAGCAAG 

34851 TGCGAGTCAC ACACGGGAGG AGCGGGAAGA GCTGGAAGAA CCATGTTTTT 
ACGCTCAGTG TGTGCCCTCC TCGCCCTICT CGACCTTCTT GGTACAAAAA 

34901 TTTTTTATTC CAAAAGATTA TCCAAAACCT CAAAAT6AAG ATCTATTAAG 
AAAAAATAAG GTTTTCTAAT AGGTTTTGGA GTTTTACTTC 1A6ATAATTC 

34951 T6AACGCGCT CCCCTCCGGT GGCGTGGTCA AACTCTACAG CCAAAGAACA 
ACTTQCGCGA GGGGA6GCCA CCGCACCAGT TTGA6ATGTC GGTTTCTTGT 

35001 6ATAATGGCA TTTGTAAGAT GTTGCACAAT GGCTTCCAAA AGGCAAAC66 
CTATTACCGT AAACATTCTA CAACGTGTTA CCGAAGGTTT TCCGTTTGCC 

35051 CCCTCACGTC CAAGTGGACG TAAAGGCTAA ACCCTTCAGG 6TGAATCTCC 
GGGAGTGCAG GTTCACCTGC ATTTCCGATT TGGGAAGTCC CACTTAGAGG 
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35101 TCTATAAACA T^fcAGCACC TTCAACCATG CCCAAATAAT TCTCATIJBG 
AGATATTTGT MTOTCGTGG AAGTTGGTAC GGGTTTATTA AGAGTAGTOC 

35151 CCACCTTCTC AATATATCTC TAAGCAAATC CCQAATATTA AGTCCG6CCA 
GGTGGAAGAG WATATAGAG ATTCGTTTA6 GGCTTATAAT TCAGGCCGGT 

35201 TTGTAAAAAT CTGCTCCAGA GCCCCCTCCA CCTTCAGCCT CAA6CAGCGA 
AACATTTTTA GACGAGGTCT CGCOGGAGGT 6GAAGTC6GA GTTCGTCGCT 

35251 ATCATGATTC CAAAAATTCA GGTTCCTCAC AGACCTGTAT AAGATTCAAA 
TAGTACTAAC GTTTTTAAGT CCAAGGAGTG TCTGGACATA TTCTAAGTTT 



35301 AGC6GAACAT TAACAAAAAT ACCGCGATCC CGTAGGTCCC TTCGCAGGGC 
TCGCCTTCTA ATTGTTTTTA TCGCGCTAG6 GCATCCAGGG AAGCGTCCCG 

35351 CAGCTGAACA TftATCGTGCA GGTCTGCACG GACCAGCGCG GCCACTTCCC 
GTCGACTTGT ATTAQCACGT CCAGACGTGC CTGGTCGCGC CGGTGAA6GG 

35401 CGCCAGGAAC CATGACAAAA GAACCCACAC TGATTATGAC ACGCATACTC 
GCGGTCCTTG GTACTGTTTT CTTGGGTGTG ACTAATACTG TGCGTATGAG 

35451 GGAGCTATGC TAACCAGCGT AGCCCCGATG TAAGCTTGTT GCATGGGCGG 
CCTCGATACG ATTGGTCGCA TCG6GGCTAC ATTCGAACAA CGTACCCGCC 

35501 CGATATAAAA TGCAAGGTGC TGCTCAAAAA ATCAGGCAAA GCCTCGCGCA 
GCTATATTTT ACGTTCCACG ACGAGTTTTT TAGTCCGTTT CGGAGCGCGT 

35551 AAAAAGAAAG CACATCGTAG TCATGCTCAT GCAGATAAAG GCAGGTAAGC 
TTTTTCTTTC GTGTAGCATC AGTACGAGTA CGTCTATTTC CGTCCATTCG 

35601 TCCGGAACCA CCACAGAAAA AGACACCATT TTTCTCTCAA ACAT6TCTGC 
AGGCCTTGCT GGTCTCTTTT TCTGTCGTAA AAAGAGAGTT TGTACAGACG 

35651 GGGTTTCTGC ATAAACACAA AATAAAATAA CAAAAAAACA TTTAAACATT 
CCCAAAGACG TATTTGTGTT TTATTTTATT GTTTTTTTGT AAATTTGTAA 

35701 AGAAGCCTGT CTTACAACAG GAAAAACAAC CCTTATAAGC ATAAGACGGA 
TCTTCGGACA GAATGTTGTC CTTTTTGTTG GGAATATTCG TATTCTGCCT 

35751 CTACGGCCAT GCCGGCGTGA CCGTAAAAAA ACTGGTCACC GTGATTAAAA 
GATGCCGGTA CGGCCGCACT GGCATTTTTT TGACCAGTGG CACTAATTTT 

35801 A6CACCACCG ACAGCTCCTC GGTCATGTCC GGAGTCATAA TGTAAGACTC 
TCGTGGTGGC TGTCGAGGAG CCAGTACAGG CCTCAGTATT ACATTCTGAG 

35851 GGTAAACACA TCAGGTT6AT TCACATCGGT CAGTGCTAAA AAGCGACCGA 
CCATTTGTGT AGTCCAACTA AGTGTAGCCA OTCACGATTT TTCGCTGGCT 

35901 AATAGCCC6G GGGAATACAT ACCOGCAQOC GTAGAGACAA CATTACAGCC 
TTATCGGGCC CCCTTAT6TA TGGGC6TCCG CATCTCTGTT GTAATOTCOG 

35951 CCCATAGGAG GTATAACAAA ATTAATAGGA GAGAAAAACA CATAAACACC 
GGGTATCCTC CATATPGTTT TAATTATOCT CTCTTTTTGT GTATTTGTGG 

36001 TGAAAAACCC TCCTGCCTAG GCAAAATAGC ACCCTCCCGC TCCAGAACAA 
ACTTTTTGGG AGGACGGATC CGTTTTATCG TGGGAGCGCC AGGTCTTGTT 
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36051 CATACAGCGC. TMACAGCG GCAGCCATAA CAGTCAGCCT TACCAG^^ 
GTATGTCGCG AAGGTGTCGC CGTCGGTATT GTCAGTCGGA ATGGTCATTT 

361D1 AAA6AAAACC TATTAAAAAA ACACCACTCG ACACGGCACC AGCTCAATCA 
TTTCrPTTGG ATAATTTTTT TGTG6TGA6C TGTGCCGT6G TCGAGTTAGT 

36X51 GTCACA6TGT AAAAAAGGGC CAAGTGCAGA GC6AGTATAT ATAGGACTAA 
CAGTGTCACA TTTTTTCCC6 GTTCACGTCT CCCTCATATA TATCCTGATT 

36201 AAAATGACGT AACGGTTAAA CTCCACAAAA AACACCCAGA AAACCGCAC6 
TTTTACTGCA TTGCCAATTT CAGGTGTTTT TTGTGGGTCT TTTOGCGTGC 



36251 CGAACCTACG OCCAGAAACG AAA6CCAAAA AACCCACAAC TTCCTCAAAT 
GCTT6GATGC QGGTCTTT6C TTTCGGTTTT TTGGGTGTT6 AA6GAGTTTA 

36301 CGTCACTTCC GTTTTCCCAC GTTACGTCAC TTCCCATTTT AAGAAAACTA 
GCAGTGAAGG CAAAAGGGT6 CAATCCAGTG AAGCCTAAAA TTCTTTTGAT 

36351 CAATTCCCAA CACATACAAG TTACTCCGCC CTAAAACCTA C6TCACCCGC 
GTTAAGGGTT GTGTATGTTC AATGAGGCGG GATTTTGGAT 6CAGTGGGCG 

36401 CCCGTTCCCA CGCCCCGCGC CACGTCACAA ACTCCACCCC CTCATTATCA 
G6GCAAGGGT GC6GGGCGCG GTGCAGTGTT TGAGGTGGGG GAGTAATAGT 



36451 TATTGGCTTC AATCCAAAAT AAGGTATATT ATTGATGATG TTAATTAAGA 
ATAACCGAAG TTAGGTTTTA TTCCATATAA TAACTACTAC AATTAATTCT 

365 01 ATTCGGATCr G CGACG CGA G GCTGG AT GGC C TTCCC CATT ATGATTCTTC 
TAAGCCTAGA CGCTGCGCTC CGACCTACCG GAAGGGGTAA TACTAAGAAG 

36551 TCGCTTCCGG CGGCATCGGG ATGCCCGCGT TGCAGGCCAT GCTGTCCAGG 
AGCGAAGGCC GCCGTAGCCC TACGGGCGCA ACGTCCGGTA C6ACAGGTCC 

36601 CAGGTAGATG AC6ACCATCA GGGACAGCTT CAAGGCCAGC AAAAGGCCAG 
GTCCATCTAC TGCTGGTAGT CCCTG7CGAA GTTCCGGTCG TTTTCCGGTC 

36651 GAACCGTAAA AAGGCCGCGT TGCTGGCGTT TTTCCATAGG CTCCCCCCCC 
CTTGGCATTT TTCCGGCGCA ACGACCGCAA AAAGGTATCC GAGGCGGGGG 

36701 CTGACGAGCA TCACAAAAAT CGACGCTCAA GTCAGAGGT6 GCGAAACCCG 
GACTGCTCGT XGTGTTTTTA GCTGCGAGTT CA6TCTCCAC CGCTTT6GGC 

36751 ACAGGACTAT AAAGATACCA GGCGTTTCCC CCTGGAAGCT CCCTCGTGCG 
TGTCCT6ATA TTTCTATGGT CCQCAAAGGG GGACCTTCGA GGGAGCACGC 

36801 CTCTCCTGTT CCGACCCT6C CGCTTACCGG ATACCTGTCC GCCTTTCTCC 
GA6AGGACAA GGCTQG6ACG 6CQAATGGCC TATGGACACG CGGAAACAGG 

36B51 CTTCGGGAAG CCTGOCOCTT TCTCATAGCT CACGCTGTA6 GTATCICAGT 
GAAGCCCTTC 6CACC6C6AA AGAGTATCGA GTGCGACATC CATAGAGTCA 

36901 TC60TGTAGG TCGTTCGCTC CAAGCTGGGC T6T6TGCACG AACCCXTCCGT 
AGCCACATCC AQCAAGCGAG GTTCGACCC6 ACACACGTGC TTGGGGGGCA 
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36951 TCAGCCCGAC Q^k^CGCCT TATCCGGTAA CTATCGTCTT" GAGTCOjMpC 
AGTCGGGCTG gRaCGCGGA ATAGGCCATT GATAGCAGAA CTCAGgPbg 

37001 CGGTAAGACA CGACTTATCG CCACTGGCAG CAGCCACT6G TAACAGGATT 
GCCATICTCT GCTGAATAGC GGTGACCGTC GTCGGTGACC ATTGTCCTAA 

37051 AGCAGAGCGA GGTATGTASG CGGTQCTACA GAGTTCTTGA AGTGGTGGCC 
TCGTCTCGCT CCATACATCC GCCACXSATGT CTCAAGAACT TCACCACCGG 

37101 TAACTACGGC OACACTAGAA GGACAGTATT TGGTATCTGC CCTCTGCTGA 
ATTGATGCCC ATCTGATCTT CCTGTCATAA ACCATAGACG CGAGACGACT 

37151 AGCCAGTTAC CTTCGGAAAA AGAGTTGGTA GCTCTTGATC CGGCAAACAA 
TCGGTCAATG 6AAGCCTTTT TCTCAACCAT CGAGAACTAG 6CCGTTTGTT 

37201 ACCACCGCTG 6TA0CGGTOG TTTTTTTGTT TGCAAGCAGC AGATTACGCG 
TGGTCGCGAC CATCCCCACC AAAAAAACAA ACGTTC6TCG TCTAATGCGC 

37251 CA6AAAAAAA GGATCTCAAG AAGATCCTTT GATCTTTTCT ACGGGGTCTG 
GTCTTTTTTT CCTAGAGTTC TTCTAGGAAA CTAGAAAAGA TGCCCCAGAC 

37301 ACGCTCAGTG GAACGAAAAC TCACGTTAAG GGATTTTGGT CATGAGATTA 
TGCGAGTCAC CTTGCTTTTG AGTGCAATTC CCTAAAACCA GTACTCTAAT 

37351 -TCAAAAAGGA TCTTCACCTA GATCCTTTTA AATCAATCTA AAGTATATAT 
AGTTTTTCCT AGAAGTGGAT CTAGGAAAAT TTAGTTAGAT TTCATATATA 

37401 GAGTAAACTT GGTCTGACAG TTACCA^TGC TTAATCAGTG AGGCACCTAT 
CTCATTTGAA CCAGACTGTC AATGGTTACG AATTAGTCAC TCCGTGGATA 

37451 CTCAGCGATC TGTCTATTTC GTTCATCCAT AGTTGCCTGA CTCCCCGTCG 
GAGTCGCTAG ACAGATAAAG CAAGTAGGTA TCAACGGACT GAGGGGCAGC 

37501 TGTAGATAAC TACGATACGG GAGGGCTTAC CATCTGGCCC CAGTGCTCCA 
ACATCTATTG ATGCTATGCC CTCCCGAATG GTAGACCGGG GTCACGACGT 

37551 ATGATACCGC GAGACCCACG CTCACCGGCT CCAGATTTAT CAGCAATAAA 
TACTATGGCG CTCTGGGTGC GAGTGGCCGA GGTCTAAATA GTCGTTATTT 

37601 CCAGCCAGCC GGAAGGGCCG AGCGCAGAAG TGGTCCTGCA ACTTTATCCG 
GGTCGGTCGG CCTTCCCGGC TCGCGTCTTC ACCA66ACGT TGAAATAGGC 

37651 CCTCCATCCA GTCTATTAAT TGTTGCCGC»G AAGCTAGAGT AAGTAGTTCG 
GGAGGTAGGT CAGATAATTA ACAACGGCCC TTCGATCTCA TTCATCAAGC 

37701 CCAGTTAATA GTTTGCGCAA CGTTGrTGCC ATTGCTACAG GCATC6TGGT 
GGTCAATTAT CAAACGCGTT QCAACAACGC TAACGATGTC CCTAGCACCA 

37751 GTCACGCTCG TCGTTTGGTA TGGCTTCATT CAGCTCCCCT TCCCAAC6AT 
CAGTGCGAGC AGCAAACCAT ACCGAAGTAA GTCGAGGCCA AGGGTTGCTA 

37801 CAAGGCGAGT TACATGATCC CCCATGfTTGT GCAAAAAAGC GGTTA6CTCC 
6TTCCGCTCA ATGTACTAGG GGGTACAACA CGTTTTTTCG CCAATCGAGG 

37851 TTCGGTCCTC CGATCGTTGT CAGAAGTAAG TTGCCCX5CAG TCTTATCACT 
AA6CCACGAG GCTAGCAACA GTCTTCATTC AACCGGCGTC ACAATAGTGA 
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37901 CATGGTTATG q^pCJMrTGC ATAATTCTCT TACTGTCATG'I&CATC ^ 

GTACCAATAC ^TOGTGACG TATTAAGAGA ATGACAGTAC GGTAGGWR'T 

37951 GATGCTTTTC TGTGACTGGT GAGTACTCAA CCAAGTCATT CTGAGAATAG 
CTACGAAAAG ACACOXSACCA CTCATGAGTT GGTTCAGTAA GACTCTTATC 

38001 TGTAT6CGGC GACCGAGTTG CTCTTGCCCG GCGTCAACAC GGGATAATAC 
ACATACGCCG CTGGCTCAAC GAGAACGGGC CGCAGTTGTG CCCTATTATG 

38051 CGCGCCACAT AGCAGAACTT TAAAAGTGCT CATCATTGGA AAACGTTCTT 
GCGCGGTGTA TCGTCTTGAA ATTTTCACGA GTAGTAACCT TTTGCAAGAA 

38101 CGGGGCGAAA ACTCTCAAGG ATCTTACC6C TGTTGAGATC CAGTTCGATG 
6CCCCGCTTT TGAGAGTTCC TAGAATGGCG ACAACTCTA6 GTCAAGCTAC 

38151 TAACCCACTC GTGCACCCAA CTGATCTTCA GCATCTTTTA CTTTCACCAG 
ATTGGGTGAG CACGTGGGTT GACTAGAAGT CGTAGAAAAT GAAAGTGGTC 

38201 CGTTTCTGG6 TGAGCAAAAA CAGGAAGGCA AAATGCCGCA AAAAAGGGAA 
GCAAAGACCC ACTCGTTTTT GTCCTTCCGT TrTACGGCGT TTTTTCCCTT 

38251 TAAGGGCGAC ACGGAAATGT TGAATACTCA TACTCTTCCT TTTTCAATAT 
ATTCCCGCTG TGCCTTTACA ACTTATGAGT ATGAGAAGGA AAAAGTTATA 

38301 TATTGAAGCA TTTATCAGGG TTATTGTCTC ATGAGCGGAT ACATATTTGA 
ATAACTTCGT AAATAGTCCC AATAACAGAG TACTCGCCTA TGTATAAACT 

38351 ATGTATTTAG AAAAATAAAC AAATAGGGGT TCCGCGCACA TTTCCCCGAA 
TACATAAATC TTTTTATTT6 TTTATCCCCA AGGCGCGTGT AAAGGGGCTT 

38401 AAGTGCCACC TGACGTCTAA GAAACCATTA TTATCATGAC ATTAACCTAT 
TTCACGGTGG ACTGC^^GATT CTTT6GTAAT-AATAGTACTG -TAATTGGATA 

38451 AAAAATAGGC GTATCACGAG GCCCTTTCGT CTTCAAGAAT TOGATCCGAA 
TTTTTATCCG CATAGTGCTC CGGGAAAGCA GAAGTTCTTA ACCTAG6CTT 

Pad 



38501 TTCTTAATTT CTTAATTAA (SBQ ID NO: 32) 
AAGAATTAAA GAATTAATT (SESQ ID NO: 33) 
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MRKAd5TicfMER1063 
(MRKAdS Pre-Adenoviral Vector Containing the G2A,LLA nef Coding Region) 



1 CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TGATAATGA6 
GTAGTAGTTA TTATATGGAA TAAAACCTAA CTTCGGTTAT ACTATTACTC 

51 GGQGTGGAGT TTCTGACGTG GCGCGGGGCG TGGGAACGGG GCCGGTCACG 
CCCCACCTCA AACACTGCAC CGCGCCCCGC ACCCTTGCCC CGCCCACTGC 

101 TAGTAGTGTG GCGGAAGTGT 6ATGTTGCAA GTGTGGCGGA ACACATGTAA 
ATCATCACAC CGCCTTCACA CTACAACGTT CACACCGCCT TGTGTACATT 

151 GCGACGGATG T3GCAAAAGT GACGTnTTG GTGTGCGCCG GTGTACACAG 
CGCTGCCTAC ACCGTTTTCA CTCCAAAAAC CACACGCGGC CACATGTGTC 

201 6AAGTGACAA TTTTCGCGCG GTTTTAGGCG GATGTTGTAG TAAATTTGGG 
CTTCACTGTT AAAAGCGCGC CAAAATCCGC CTACAACATC ATTTAAACCC 

251 CGTAACCGAG TAAGATTTGG CCATTTTCGC GGGAAAACTG AATAA6AGGA 
GCATTGGCTC ATTCTAAACC GGTAAAAGCG CCCTTTTGAC TTATTCTCCT 

301 AGTGAAATCT GAATAATTTT GTGTTflCTCA TAGCGCGTAA TATTTGTCTA 
TCACTTTAGA CTTATTAAAA CACAATGAGT ATCGCGCATT ATAAACAGAT 

351 GGGCCGCGGG 6ACTTTGACC GTTTACGTGG AGACTCGCCC AGGTGTTTTT 
CCCGGCGCCC CTGAAACTGG CAAATGCACC TCTGAGCGGG TCCACAAAAA 

401 CTCAGGTGTT TTCCGCGTTC C6GGTCAAAG TTGGCGTTrT ATTATTATAG 
GAGTCCACAA AA6GCGCAAG GCCCAGTTTC AACCGCAAAA TAATAATATC 

451 GCGGCCGCGA TCCATTGCAT ACGTrGTATC CATATCATAA TATGTACATT 
CGCCGGCGCT AGGTAACGTA TGCAACATAG GTATAGTATT ATACATGTAA 

501 TATATTGGCT CATGTCCAAC ATTACCGCCA TGTTGACATT GATTATTGAC 
ATATAACCGA GTACAGGTTG TAATGGCGGT ACAACTGTAA CTAATAACTG 

551 TAGTTATTAA TAGTAATCAA TTACGG6GTC ATTAGTTCAT AGCCCATATA 
ATCAATAATT ATCATTAGTT AATGCCCCAG TAATCAAGTA TCGGGTATAT 

601 TGGAGTTCCC CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACC6 
ACCTCAAGGC GCAATGTATT GAATGCCATT TACCGGGCGG ACC6ACTGGC 

651 CCCAACGACC CCCQCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT 
GGGTTGCTG6 GGGCGGGTAA CTGCAGTTAT TACTOCyiTAC AAGGGTATCA 

701 AACGCCAATA GGGACTTTCC ATTGACGTCA ATGGGTGGAG TATTTACGGT 
TTGCGGTTAT CCCTGAAAOG TAACTGCAOT TACCCACCTC ATAAATGCCA 

751 AAACTGCCCA CTTGGCAGTA CATCAAGTGT ATCATATGCC AAGTACGCCC 
TTTGACGGGT OAACCGTCAT GTAGTTCACA TAGTATACGG TTCAT6CGGG 

801 CCTATTGACG TCAATGACGG TAAATG6CCC GCCTGGCATT ATGCCCAGTA 
G6ATAACTQC ACTTACTQCC ATTTACCGGG CGGACCGTAA TACGGGTCAT 
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851 



CAT6ACCTTA :||BpACTTTC 

GTACTG6AAT xUcTOfiAAG 



CTACTTGGCA 
GATGAACCGT 



GTACATCTAC GTATT] 
CATGTAGATG CATAA' 




:a 

IT 



901 



TCGCTATTAC CATGGTGATG 



CGGTTTTGGC 



AGTACATCAA TGGGCGTGGA 



AGCGATAATG GTACCACTAC GCCAAftACCG TCATGTAGTT ACCCGCACCT 

951 TAGCGGTTTG ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA 
ATCOCCAAAC T6AGTGCCCC TAAAGGTTCA GAGGTGGGGT AACTGCAGTT 

1001 TGGGAGTTTG TTTTGGCACC AAAATCAACG GGACTTTCCA AAATGTCGTA 
ACCCTCAAAC AAAACCGTGG TTTTAGTTGC CCTGAAAGGT TTTACAGCAT 

1051 ACAACTCCGC CCCATTGACG CAAATGGGCG GTAGGCGTGT ACGGTGGGAG 
TGTTGAGGCG 6SGTAACTGC GXTTACCTGC CATCCGCACA TGCCACCCTC 

1101 GTCTATATAA GCAGAGCTCG TTTAGTGAAC CGTCAGATCG CCTGGAGACG 
CAGATATATT CGTCTCGAGC AAATCACTTG GCAGTCTAGC GGACCTCTGC 

1151 CCATCCACGC TGTTTTGACC TCCATAGAAG ACACCGGGAC CGATCCAGCC 
GGTAGGTGCG ACAAAACTGG AGGTATCTTC TGTGGCCCTG GCTAGGTCGG 

1201 TCCGCGGCCG GGAACGGTGC ATTGGAACGC GGATTCCCCG TGCCAAGAGT 
AGGCGCCGGC CCTTGCCACG TAACCTTGCG CCTAAGGGGC ACGGTTCTCA 

12 51 GAGATCTGCC ACCATGGCCG GCAAGTGGTC CAAGAGGTCC GTGCCCGGCT 
CTCTAGACGG TGGTACCGGC CGTTCACCAG GTTCTCCAGG CACGGGCCGA 

1301 GGTCCACCGT GAGGGA6AGG ATGAGGAGGG CCGAGCCCGC CGCCGACAGG 
CCAGGTGGCA CTCCCTCTCC TACTCCTCCC GGCTCGGGCG 6CGGCTGTCC 

1351 GTGAG6AGGA CCGAGCCCGC CGCAGTGGGC GTGGGCGCCG TGTCCAGGGA 
— CACTCCTCCT- GGCTCGGGCG GCGTCACCCG-CACCGGGGGC-ACACGTCCCT 

1401 CCT6GAGAAG CACGGCGCCA TCACCTCCTC CAACACCGCC GCCACCAACG 
GGACCTCTTC GTGCCGCGGT AGT6GAG6AG GTTGTG6CGG CGGTGGTT6C 

1451 CCGACTGCGC CTGGCTGGA6 GCCCAGGAGG ACGAGGAGGT GGGCTTCCCC 
GGCTCACGCG GACCGACCTC CGGGTCCTCC TGCTCCTCCA CCCGAAGGGG 

1501 GTGAGGCCCC AGGTGCCCCT GAGGCCCATG ACCTACAAGG GCGCCGTGGA 
CACTCCGGGG TCCACGGGGA CTCCGGGTAC TGGATGTTCC CGCGGCACCT 

1551 CCTGTCCCAC TTCCTGAAGG A6AAGGGCGG CCTGGA6GQC CTGATCCACT 
GGACAGGGTG AA6CACTTCC TCTTCCCGCC GGACCTCCCG GACTAGGTGA 

1601 CCCAGAA6AG GCAGGACATC CTGGACCTGT GGGTGTACCA CACCCAGGGC 
GGGTCTTCTC C6TCCTGTA6 GACCTGGACA CCCACATGGT GTGGGTCCCG ' 

1651 TACTTCCCCG ACTGGCAGAA CTACACCCCC GGCCCCGGCA TCAGGTTCCC 
AT6AAGGGGC TGACCGTCTT GATGTGGGGG CCQGGGCCGT AGTCCAAGGG 

1701 CCTGACCTTC GGCTGGTGCT TCAAGCTGGT GCCCGTGGAG CCCGAGAAGG 
GGACTGGAAG CCSACCACGA AGTTCGACCA CGGGCACCTC GGGCTCTTCC 

1751 TGGAGGAGGC CAACGAGGGC GAGAACAACT GCGCCGCCCA CCCCATGTCC 
ACCTCCTCCG GTTGCTCCCG CTCTT6TTGA CGCGGCGGGT QGGGTACAGG 
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IBOI 
1851 
1901 
1951 
2001 
2051 
2101 
2151 
2201 
2251 
2301 
2351 
2401 
2451 
2501 
2551 
2601 
2651 
2701 



GTCGTGCCGT wScTCCTGGG GCTCTTCCTC CACGACCTCA CCTCCASBCT 

CTCCAAGCTG GCCTTCCACC J^CGTGOCCAG GGAGCTQCAC CCCGAGTACT 
GAGGTTCGAC CX5GAAGGTGG TOCACOGGTC 0CTCGACGT6 GGGCTCATGA 

ACAAGGACTC CTAAACCCCG GGCAGATCTG CTGTCCCTTC TAGTTGCCAC ' 
TGTTCCTGAC GATTTCGGGC CCGTCTAGAC GACACGGAAG ATCAACGGTC 

CCATCTGTTO TTTGCCCCTC CCCCGTCCCT TCCTTGACCC TG6AAGGTGC 
GGTAGACAAC AAACOGGGAG GGGGCACGGA AGGAACTGGG ACCTTCCACG 

CACTCCCACT GTCCTTTCCT AATAAAAT6A GGAAATTGCA TCGCATTGTC 
GTGAGGGTGA CAGGAAAGGA TTATTrTACT CCTTTAACGT AGCGTAACA6 

T6AGTA6GT6 TCATTCTATT CTGGGGGGTG GGGTGGGGCA GGACAGCAAG 
ACTCATCCAC AGTAAGATAA GACCCOCCAC CCCACCCCGT CCT6TCGTTC 

GGGGAGGATT OGGAAGACAA TAGCAGGCAT GCTG6GGATG CGGTGGGCTC 
CCCCTCCTAA CCCTTCTGTT ATCGTOCGTA CGACCCCTAC GCCACCCGAG 

TAT6GCC6AT CX3GCGCGCCG TACT6AAATG TGTGGGCGTG GCTTAAGGGT 
ATACCG6CTA GCCGCGCGGC ATGACTTTAC ACACCCGCAC CGAATTCCCA 

CGGAAAGAAT ATATAAGGTG GGGGTCTTAT GTAGTTTTGT ATCTGTTTTG 
CCCTTTCTTA TATATTCCAC CCCCAGAATA CATCAAAACA TAGACAAAAC 

a^GCAGCCGC CGCCGCCATG AGCACCAACT CGTTTGATGG AAGCATTGTG . 
GTCGTCGGCG QCGGCGGTAC TCGTGGTT6A GCAAACTACC TTCGTAACAC 

A3CTCATATT TGACAACGCG CATGCCCCCA TGGGCCGGGG TGCGTCAGAA 
TCGAGTATAA ACTGTTGCGC GTACGGGGGT ACCCGGCCCC ACGCAGTCTT 

TGTGATGGGC TCCAGCATTG ATGGTCGCCC CGTCCTGCCC GCAAACTCTA 
ACACIACCCG AGGTCGTAAC TACCAGCGGG GCAGGACGGG CGTTTGAGAT 

CTACCTTGAC CTACGAGACC GTGTCTGGAA CGCCGTTGGA GACTGCAGCC 
GATGGAACTG GATGCTCTGG CACAGACCTT GCGGCAACCT CTGACGTCGG 

TCCGCCGCCG CTTCAGCCGC TGCAGCCACC GCCCGCGGGA TTGTGACTGA 
AGGCGGCGGC GAAGTCGGCG ACGTCGGTGG CGGGCGCCCT AACACTGACT 

CTTTGCTTTC CTGAGCCCGC TTGCAAACAG TGCAGCTTCC CGTTCATCCG 
GAAACGAAA6 GACTCGGGCG AACGTTTGTC ACGTCGAAGG GCAA6TAGGC 

CCCGCGATGA CAAGTTGACG GCTCTTTTGG CACAATTGGA TTCTTTGACC 
GGGCGCTACT STTCAACTGC CGAGAAAACC GTGTTAACCT AAGAAACTGG 

CGGGAACTTA ATGTCGTTTC TCAGCAGCTG TTGGATCTCC GCCAGCAGCT 
GCCCTTGAAT TACAGCAAA6 AGTCGTCGAC AACCTAGACG CGGTCGTCCA 

TTCTGCCCT6 AAGGCTTCCT CCCCTCCCAA TGCGGTTTAA AACATAAATA 
AAGACGGGAC TTCCGAA66A GGGGAGG6TT ACGCCAAATT TrGTATTTAT 

AAAAACCAGA CTCTQTTTGO ATTTGGATCA AGCAAGTGTC TTGCTGTCTT 
TTTTTGCTCT GAGACAAACC TAAACCTAGT TCGTTCACAG AACGACAGAA 
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2751 TATTTAGGGG T^WGCGCGC CCGGTAGGCC CGGGACCAGC GCTC3CZ!WC 
ATAftATCCCC AAAACGCGCG CGCCATCCGG QCCCTGGTCG CCAGAGCCAG 

2801 6TTGAGGGTC CTGTGTATTT TTTCCAGGAC GTGGTAAAGG TGACTCTGGA 
CAACTCCCAG GACACATAAA AAAGGTCCTG CACCATTTCC ACTGAGACCT 

2851 TGTTCAGATA CATGGGCATA AGCCC6TCTC TG6GGTGGAG STA6CACCAC 
ACAAGTCTAT CTACCCCTAT TCGGGCAGAG ACCCCACCTC CATCGTGGTG 

2501 TGCAGAGCTT CATGCTGCC6 GCTGGTGTTG TAGATGATCC AGTCGTAGCA 
ACGTCTCGAA GTACGACOCC CCACCACAAC ATCTACTAG6 TCAGCATCGT 

2951 GGAGCGCTGG GCGTGGTGCC TAAAAATGTC TTTCAGTAGC AAGCTGATTG 
CCTCGCGACC CGCACCACGG ATTTTTACAG AAAGTCATCG TTCGACTAAC 

3001 CCAGGGGCAG GCCCTT6GTG TAAGTGTTTA CAAAGCSGTT AAGCTGGGAT 
GGTCCCCGTC CGGGAACCAC ATTCACAAAT GTTTCGCCAA TTCGACCCTA 

3051 GGGTGCATAC GTGGGGATAT 6AGATGCATC TTGGACTGTA TTTTTAGGTr 
CCCACGTATG CACCCCTATA CTCTACGTAG AACCTGACAT AAAAATCCAA 

3101 GGCTATGTTC CCAGCCATAT CCCTCCGGGG ATTCA7GTTG TGCAGAACCA 
CCGATACAAG GGTCGGTATA GGGAGGCCCC TAAGTACAAC ACGTCTTGGT 

3151 CCAGCACAGT GTATCC6GTG CACTTGGGAA ATTTGTCATG TAGCTTAGAA 
GGTCGTGTCA CATAGGCCAC GTGAACCCTT TAAACAGTAC ATCGAATCTT 

3201 GGAAATGCGT G3AAGAACTT GGAGACGCCC TTGTGACCTC CAAGATTTTC 
CCTTTACGCA CCTTCTTGAA CCTCTGCGGG AACACTGGAG GTTCTAAAAG 

"3251 CATOCATTCG~ICCATAATGA~T^ GCGGCCTGGG 
GTACGTAAGC AGGTATTACT ACCGTTACCC GGGTGCCCGC CGCCGGACCC 

3301 CGAACSATATT TCTGGGATCA CTAACGTCAT AGTTGTGTTC CAGGATGAGA 
GCTTCTATAA AGACCCTAGT GATTGCAGTA TCAACACAAG GTCCTACTCT 

3351 TCGTCATAGG CCATTTTTAC AAAGCGCGGG CGGAGGGTGC CAGACTGCGG 
AGCAGTATCC GGTAAAAATG TTTCGCGCCC GCCTCCCACG CTCTGACGCC 

3401 TATAATGGTT CCATCCGGCC CACGCGCGTA GTTACCCTCA CAGATTTGCA 
ATATTACCAA QGTAGGCCGG GTCCCCGCAT CAATGGGAGT GTCTAAACGT 

3451 TTTCCCACGC TTTGAGTTCA GAT6GGGGGA TCAT6TCTAC CTGCGGGGCG 
AAAGGGTOCG AAACTCAAGT CTACCCCCCT AGTACAGATG GACGCCCCGC 

3501 ATGAAGAAAA CX3GTTTCCGG GGTAQGGGAG ATCAGCTGQG AA GAAAG CAG 
TACTTCTTTT GCCAAAGGCC CCATCCCCTC TAGTCGACCC TTCTTTCGTC 

3551 GTTCCTGAGC AGCTGCGACT TACCQCAGCC GGTGGGCCCG TAAATCACAC 
CAAGGACTCG TCGACGCTGA ATG6CGTCGG CCACCCGGGC ATTTAGTGTG 

3601 CTATTACCGG CTGCAACTGG TAGTTAAGA6 ASCTGCA6CT GCXrGTCATCC 
GATAATGGCC GACGTTGACC ATCAATTCTC TCGACGTCGA CQGCAGTAG6 

3651 CTGAGCAGG6 GGGCCACTTC GTTAAGCATG TCCCTGACTC GCATGTTTTC 
GACTCGTCCC CCCGGTGAAG CAATTCGTAC ACCGACTCAC CGTACAAAAG 
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3701 
3751 
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4001 
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GGACTGGTTT AGGCGGTCTT CCGCGAGCGG CGGGTCGCTA TCGTCAAGAA 

GCAAGGAAGC AAAGTTTTTC AACGGTTTGA GACCGTCCGC CGTAGGCATG 
CGTTCCTTCG TTTCAAAAAG TTGCCAAACT CTGGCAGGCG GCATCCGTAC 

CTTTTGAGCG TTTGACCAAG CAGTTCCAGG CGGTCCCACA GCTCGGTCAC 
GAAAACTCGC AAACTGGTTC GTCAA6GTCC GCCAGGGTGT CGAGCCAGTG 

CTGCTCTACG GCATCTCGAT CCAGCATATC TCCTCGTTTC 6CGGGTTGGG 
GACGAGATGC CGTAGAGCTA GGTCGTATA6 AGGAGCAAA.G CGCCCAACCC 

GCGGCTTTCG CTGTACGGCA GTA6TCGGTG CTCGTCCAGA CGQ6CCAGG6 
CGCCGAAAGC GACATGCCGT CATCAGCCAC GA6CAGGTCT 6CCCGGTCCC 

TCATGTCTTT CCACGCCCCC ACCCTCCTCC TCACCCTAGT CTGCGTCACG 
AGTACAGAAA GGTGCCCGCG TCCCAOGAGC A9TCGCATCA GACCCAGMC 

6TGAAGGGGT GCGCTCC6GG CTGCGCCCTG GCCA6GGTGC GCTTGAGGCT 
CACTTCCCCA CGCGAGGCCC GAC6CGCCAC CGGTCCCAC6 CGAACTCCGA 

G6TCCTGCTG GT6CTGAA6C GCT6CCG6TC TTCGCCCTGC 6CGTCGGCCA 
CCAGGACGAC CACGACTTC6 CGACGGCCAG AAGCGGGACG CGCAGCCGGT 

GGTAGCATTT GACCAT6GTG TCATAGTCCA 0CCCC7CCGC GGCGTGGCCC 
CCATCGTAAA CTGGTACCAC ACTATCACGT CGGGGAGGC6 CCGCACCGGG 

TTGGCGCGCA GCTTGCCCTT GGAGGAGGCG CCGCACGAGG GGCAGTGCAG 
AACCGCGCGT CGAACGGGAA CCTCCTCCGC GGCGTGCTCC CCGTCACGTC 

ACTTTTGAGG GC6TAGAGCT TGGGCGCGAG AAATACCGAT TCCGGGGAGT 
TGAAAACTCC CGCATCTCGA ACCCGCGCTC TTTATGGCTA ACGCCCCTCA 

AGGCATCCGC GCCGCAGGCC CCGCAGACGG TCTCGCATTC CACGAGCCAG 
TCCGTAGGCG CG6CGTCCGG 6GCGTCTGCC AGAGCGTAAG GTGCTCGGTC 

GTGAGCTCT6 GCCGTTCGGG GTCAAAAACC AGGTTTCCCC CATGCTTTTT 
CACTCGAGAC CGGCAAGCCC CAGTTTTTOG TCCAAAGGGG GTACGAAAAA 

GATGCGTTTC TTACCTCTGG TTTCCATGAG CCGGTGTCCA CGCTCGGTGA 
CTACGCAAAG AATGGAGACC AAAGGTACTC GGCCACAGGT GCGAGCCACT 

CGAAAAGGCr GTCCGTGTCC CCGTATACAG ACTTGAGAGG CCTGTCCTCG 
GCTTTTCCGA CAGGCACAGG GGCATATGTC TGAACTCTCC GGACAGGAGC 

AGCGGTGTTC CGCGGTCCTC CTCGTATAGA AACTCGGACC ACTCTGAGAC 
TCGCCACAAG GCGCCAGGAG GAGCA^ATCT TTGAGCCTGG TGAGACTCTG 

AAA6GCTCGC GTCCAGGCCA GCACX3AAGGA GGCTAAGTGG GAGGGGTAGC 
TTTCCGAGCG CAGGTCCGGT CGTGCTTCCT CCGATTCACC CTCCCCATCG 

GGTCGTTGTC CACTAGGGGG TCCACTCGCT CCAGGGTGTG AAGACACATG 
CCAGCAACAG GTGATCCCCC A6GTGAGCGA GGTCCCACAC TTCTGTGTAC 

TCGCCCTCTT CGGCATCAAG GAAGGTGATT GGTTTGTAGG TGTAGGCCAC 
AGCGGGAGAA GCCGTAGTTC CTTCCACTAA CCAAACATCC ACATCCGGTG 
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4651 GTGACCGGGT j^pbCTGMG GGGGGCTATA AAAG666GTG GGGGCj||PrT 

CACTGGCCCA CAAGGACTTC CCCCCGATAT TTTCCCCCAC CCCCGCGCAA 

m 

4701 CGTCCTCACT CTCTTCCGCA TCGCTGTCT6 CGAGGGCCAG CTGTTGGGGT 
GCAGGAGTGA GAGAAGGCGT AGCGACAGAC GCTCCCGGTC GACAACCCCA 

4751 GAGTACTCCC TCTGAAAAGC GGGCATGACT TCTGCGCTAA GATTGTCAGT 
CTCATGAGGG AGACTTTTCG CCCGTACTGA AGACGCGATT CTAACAGTCA 

4801 TTCCAAAAAC GAGGAGGATT TGATAT7CAC CT6GCCCGCG GTGAT6CCTT 
AAGGTTTTTG CTCCTCCTAA ACTATAAGTG GACCGGGCGC CACTACGGAA 

4851 TGAGGGTGGC CGCATCCATC TGGTCAGAAA AGACAATCTT TTTGTT6TCA 
ACTCCCACCG GCGTA6GTAG ACCAGfTCTTT TCTGTTAGAA AAACAACAGT 

4901 AGCTTGGTGG CAAACGACCC GTAGAGGGCG TTG6ACAGCA ACTTGGCGAT 
TCGAACCACC GTTTGCTGGG CATCTCCCGC AACCTGTCGT TGAACCOCTA 

4951 GGAGCGCAGG GTTTGGTTTT TGTCQCGATC GGCGCGCTOC TTGGCCGCGA 
CCTCGCGTCC CAAACCAAAA ACAGCGCTAG CCGCGC6AGG AACCGGCGCT 

5001 . TGTTTA6CTG CACGTATTCG C6CGCAACGC ACCGCCATTC GGGAAAGACG 
ACAAATCGAC GTGCATAAGC GCGCGTTGCG TGGCGGTAAG CCCTTTCTGC 

5051 GTGGTGCGCT CGTCGGGCAC CAGGTGCACG CGCCAACCGC GGTTGTGCAG 
CACCACGC6A GCAGCCCGTG GTCCACGTGC GCGGTTGGCG CCAACACGTC 

5101 GGTGACAAGG TCAACGCTGG TGGCTACCTC TCCGCGTAGG CGCTCGTTGG 
CCACTGTTCC AGTTGCGACC ACCGATGGAG AGGCGCATCC GCGAGCAACC 

5151 TCCAGCAGA G GCGGCCGCCC TTGCGCGAGC AGAATGG CG G TAGG GGGTCT 
AGGTCGTCTC CGCCGGCGGG AACGCGCTCG TCTTACCGCC ATCCCCCAGA 

5201 AGCTGCGTCT CGTCCGGGGG GTCTGCGTCC ACGGTAAAGA CCCCGGGCAG 
TCGACGCAGA GCAGGCCCCC CAGACGCAGG TGCCATTTCT GGGGCCCGTC 

5251 CAGGCGCGCG TCGAAGTAGT CTATCTTGCA TCCTTGCAAG TCTAGCGCCT 
GTCCGCGCGC AGCTTCATCA GATAGAACGT AGGAACGTTC AGATCGCGGA 

5301 GCTGCCATGC GCGGGCGGCA AGCGC6CGCT CGTATGGGTT GAGTGGGGGA 
CGACGGTACG CGCCCGCCGT TCGCGCGCGA GCATACCCAA CTCACCCCCT 

5351 CCCCATGGCA TGGGGTGGGT GAQCGCGGAG GCGTACATGC CGCAAATGTC 
GGGGTACCGT ACCCCACCCA CTCGC6CCTC CGCATGTACG GCGTTTACAG 

5401 GTAAACGTAG A6GGGCTCTC TGAGTATTCC AAGATATGTA GGGTAGCATC 
CATTTGCATC TCCCCGAGAG ACTCATAAGG TTCTATACAT CCCATCGTAG 

5451 TTCCACCGCG GATGCTGGC6 CGCAC6TAAT 06TATAGTTC GTGCGAGGGA 
AAGGTGGCGC CTACGACCGC eCGTGCATTA GCATATCAAG CACGCTCCCT 

5501 6CGAGGAGGT CGGGACCGAG GTTOCTACGG GCGGGCTGCT CTGCTCGGAA 
CGCTCCTCCA 6C0CTGGCTC CAACGATQCC CGCCCGACGA GACGAGCCTT 

5551 GACTATCTGC CTCAAGATGG CATGTGAGTT GGATGATATG GTTGGACGCT 
CTGATAGACG GACTTCTACC GTACACTCAA CCTACTATAC CAACCTGCGA 




101/144 



wo 02/22080 



PCT/USOl/28861 



5601 
565X 
5701 
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GGAAGACGTT G^kcTQGCG TCTGTGAGAC CTACCGCGTC ACGCAGM^G 
CCTTCTGCAA cWfcGACCGC A6ACACTCTG GATGGCGCAG TGCGTdWx: 

GAGGCGTAGG ASTCGCOCAG CTTGTTGACC AGCTCGGCGG TGACCT6CAC 
CTCCGCATCC TCAGCCCOTC GAACAACTGG TC6AGCCGCC ACTGGACGT6 

GTCTAGGGCG CAGTAGTCCA GGGTTTCCTT GATGATGTCA TACTTATCCT 
CAGATCCCGC GTCATCAG6T CCCAAAG6AA CTACTACAGT AT6AATAGGA 

GTCCCTTTTT TTTCCACAGC TCGCGGTTGA GGACAAACTC TTCGCGGTCT 
CAGGGAAAAA AAAGGTGTCG AGCGCCAACT CCTGTTTGAG AAGCGCCAGA 

TTCCAGTACT CTTGGATCGG AAACCCGTCG GCCTCCGAAC GGTAAGAGCC 
AAGGTCATGA GAACCTAGCC 7ITGGGCAGC CGGAGGCTT6 CCATTCTCGG 

TAGCATGTAG AACTGGTTGA CGGCCTC3GTA GGCGCAGCAT CCCTTTTCTA 
ATCGTACATC TTGACCAACT OCCGGACCAT CCGCGTCGTA GGGAAAAGAT 

CGG6TAGCGC GTATGCCTGC GCGGCCTTCC GGAGCGAGGT GTGGGTGAGC 
GCCCATCGCG CATACGGACG CGCCGGAAGG CCTCGCTCCA CACCCACTCG 

GCAAAGGTGT CXTCTGACCAT GACTTTGAGG TACTGGTATT TGAAGTCAGT 
CGTTTCCACA GGGACTGGTA CTGAAACTCC AT6ACCATAA ACTTCAGTCA 

GTCGTCGCAT CCGCCCTGCT CCCAGAGCAA AAAGTCCGTG CGCTTTTTGG 
CAGCAGCGTA GGCGGGACGA GGGTCTCGTT TTTCAGGCAC GCGAAAAACC 

AACGCGGATT TGGCAGGGCG AAGGTGACAT CGTTGAAGAG TATCTTTCCC 
TTGCGCCTAA ACCGTCCCGC TTCCACT6TA GCAACTTCTC ATAGAAAGGG 

GCGCGAGGCA TAAAGTTGCG TGTGATGCGC AAGGGTCCCG GCACCTCGGA 
CGCGCTCCGT ATTTCAACGC ACACTACGCC TTCCCAGGGC CGTGGAGCCT 

ACGGTTGTTA ATTACCTGGG CGGCGAGCAC GATCTCGTCA AAGCCGTTGA 
TGCCAACAAT TAATGGACCC GCCGCTCGTG CTAGAGCAGT TTCGGCAACT 

TGTTGTGGCC CACAATGTAA AGTTCCAAGA AGCGCGGGAT GCCCTTGATG 
ACAACACCGG GTGTTACATT TCAAGGTTCT TCGCGCCCTA CGGGAACTAC 

GAAGGCAATT TTTTAAGTTC CTCGTAGGTG AGCTCTTCAG GGGAGCTGAG 
CTTCCGTTAA AAAATTCAAG GACCATCCAC TCGAGAAGTC CCXTTCGACTC 

CCCGTGCTCT GAAAGGGCCC AGTCTGCAAG ATGAGGGTTG GAAGCGAC6A 
GGGCACGAGA CTTTCCCGGG TCAGACGTTC TACTCCCAAC CTTCGCTGCT 

ATGAGCTCCA CA6GTCACGG GCCATTAGCA TTTGCAGGTG GTCGCGAAAG 
TACTCGAGGT GTCCAQTGCC CGGTAATCGT AAACGTCCAC CACCGCTTTC 

GTCCTAAACT GGCGACCTAT GGCCATTTTT TCTGOCGTGA TGCAGTAGAA 
CAGGATTTGA CCGCTGGATA CCGGTAAAAA A6ACCCCACT ACGTCATCTT 

GGTAAGCGGG TCTTGTTCCC AGCGGTCCCA TCCAAGGTTC GCGGCTAGGT 
CCATTCGCCC AGAACAAGGG TCGCCAGGGT AGGTTCCAAG CGCCGATCCA 

CTCGCGCGGC AGTCACTAGA GGCTCATCTC CX3CCGAACTT CATGACCAGC 
GAGCCCGCCG TCAGTGATCT CCGAGTAGAG GCGGCTTGAA GTACTGGTC6 
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6551 ATGAAGGGCA .cAsCTGCTT CCCAAAGGCC CCCATCCAAG TATAC<3»C 
TACTTCCCGT offf&GACGAA GGGTTTCCGG GGGTAGGTTC ATATCCJBRG 

6601 TACATCGTAG GTGACAAAGA GACGCTCGGT GCGAGGATGC GAGCCGATCG 
ATGTAGCATC CACTGTTTCT CTGCGAGCCA CGCTCCTACG CTCGGCTA6C 

6651 GGAAGAACTG GATCTCCXGC CACCAATTGG AGGAGTGGCT ATTGATGTGG 
CCTTCTTGAC CTAGAGGGCG GTGGTTAACC TCCTCACCGA TAACTACACC 

6701 TGAAAGTAGA AGTCCCTGCG ACGGGCCGAA CACTCGTGCT GGCTTTTGTA 
ACTTTCATCT TCACGGACGC TOCCCGGCTT GTGAGCACGA CCGAAAACAT 

6751 AAAACGTGCG CAGTACTGGC AGCGGTGCAC GGGCTGTACA TCCTGCACGA 
TTTTGCACGC GTCATGACCG TCGCCACGT6 CCCGACATGT AGGAC6TGCT 

6801 GGTTGACCTG ACGACCGCGC ACAAGGAAGC AGAGTGGGAA TTTGAGCCCC 
CCAACTGGAC TGCTGGCGCG TGTTCCTTCG TCTCACCCTT AAACTCCCGG 

6851 TCGCCTGGCG GGTTTOGCT6 GTGGTCTTCT ACTTCGGCTG CTTGTCCTTG 
AGCGGACCGC CCAAACCGAC CACCAGAAGA TGAAGCCGAC GAACAGGAAC 

6901 ACC6TCTGGC TGCTCGAGGG GAGTTACGGT GGATCGGACC ACCACGCCGC 
TGGCAGACCG ACGAGCTCCC CTCAATGCCA CCTAGCC7C-G TGGTGCGGCG 

6951 GCGAGCCCAA AGTCCAGATG TCCGCGCGCG GCGGTCGGAG CTTGATGACA 
CGCTCGGGTT TCAGGTCTAC AGGCGCGCGC CGCCAGCCTC GAACTACTGT 

7001 ACATCGCGCA GATGGGAGCT GTCCATGGTC TGGAGCTCCC GCGGCGTCAG 
TGTAGC6C6T CTACCCTCGA CAGGTACCAG ACCTCGAGGG CGCCGCAGTC 

7051 GTCAGGCG6G AGCTCCTGCA GGTTTACCTC GCATAGACGG GTCAGGGCGC 
CACTCCGCCC~TCGAG»C^ CCAAATGGAG CGTATCTGCC CAGtCCCGCG 

7101 GGGCTAGATC CAGGTGATAC CTAATTTCCA GGGGCTGGTT GGTGGCGGCG 
CCCGATCTAG GTCCACTATG GATTAAAGGT CCCCGACCAA CCACCGCCQC 

7151 TCGATGGCTT GCAAGAGGCC GCATCCCCGC GGCGCGACTA CGGTACCGCG 
AGCTACCGAA CGTTCTCCGG CGTAGGGGCG CCGCGCTGAT GCCATGGCGC 

7201 CGGCCGGCGG TGGGCCGCGG GGGTGTCCTT 6GATGATGCA TCTAAAAGCG 
GCCGCCCGCC ACCCGGCGCC CCCACAG6AA CCTACTACGT AGATTTTCGC 

7251 GTGACGCGGG CGAGCCCCCG GAGGTA6GG6 6GGCTCCGGA CCCGCCGGGA 
CACTGCGCCC GCTCGGGGGC CTCCATCCCC CCCGAGOCCT GQGCGGCCCT 

7301 GAGGGGGCAG GGGCACGTCG GCGCCGC6CG CGGGCAGGAG CTGGTGCTQC 
CTCCCCCGTC CCCGTGCA6C CGCGGCGCGC OCCCGTCCTC GACCACGACG 

7351 GCGCGTAGGT TGCTGGCGAA CGCGACGACG CGGCOGTTGA TCTCCTGAAT 
CGCGCATCCA ACGACGGCTT CCGCTGCT6C GCCGCCAACT AGAGGACTTA 

7401 CT66CGCCTC TGCGTGAAGA CGACOGGCCC GGTGAGCTTG AACCTGAAAG 
GACCGCGGAG ACGCACTTCT OCTGCCCOGG CCACTCGAAC TTOGACTTTC 

7451 AGAGTTCGAC AGAATCAATT TCGGTGTCGT TGACGGC6GC CTGGCGCAAA 
TCTCAAGCTG TCTTAGTTAA AGCCACAGCA ACTGCCGCCG GACC6CGTTT 
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ATCTCCTCCA O^TTCTGA GTTGTCTTGA TAGGCG;m^ QGGCmmLfc' 
TAGAGGACGT dHEAGGACT CAACAG^ACT ATCCGCTAGA GCCGGTlVrT 

CTGCTCGATC TCTTCCTCCT GGAGATCTCC GCGTCCGGCT CGCTCCACGG 
GACGAGCTAG AGAAGGAG6A CCTCTAGAGG CGCAGGCCGA GCGAGGTGCC 

TGGCGGCGAG GTCGTTGGAA ATGCGGGCCA TGAGCTGCGA GAAGGCGTTG 
ACCGCC6CTC CACCAACCTT TACCCCCGGT ACTCGACGCT CTTCCGCAAC 

AGGCCTCCCT CGTTCCAGAC GCGGCTCTAG ACCACGCCCC CTTCGGCATC 
TCCG6A66GA GCAAGGTCTG CGCCGACATC TGGTGCGGG6 GAAGCC6TA6 

GCGGGCGCGC ATGACCACCT GCGCGAGATT GAGCTCCACG TGCCGGGCGA 
CGCCCGCGCG TACT6GTGGA CGC6CTCTAA CTCGAGGTGC AC6GCCCGCT 

AGACGGCGTA GTTTCGCAGG CGCTGAAAGA GGTAGTTGAG GGTGGTGGCG 
TCTGCCOCAT CAAAGCGTCC GCGACTOTCT CCATCAACTC CCACCACCGC 

GTGTGTTCTG CCACGAAGAA GTACATAACC CAGCGTCGCA ACGTGGATTC 
CACACAAGAC GGTGCTTCTT CATGTATTGG GTCGCAGCGT TGCACCTAAG 

GTTGATATCC CCCAAGGCCT CAAGGCGCTC CATGGCCTCG TAGAAGTCCA 
CAACTATAGG GGGTTCCGGA GTTCCGCGAG GTACCGGAGC ATCTTCAGGT 

CGGCGAAGTT 6AAAAACTGG GAGTTGCGCG CCGACACGGT TAACTCCTCC 
GCCGCTTCAA CTTTTTGACC CTCAACGCGC GGCTGTGCCA ATTGAGGAGG 

TCCAGAAGAC GGATGAGCTC GGCGACAGTG TCGCGCACCT CGCGCTCAAA 
AGGTCTTCTG CCTACTCGAG CCGCTGTCAC AGCGCGTGGA GCGCGAGTTT 

GGCTACAGGG GCCTCTTCTT CTTCTTCAAT CTCCTCTTCC ATAAGGGCCT 
CCGATGTCCC CX3GAGAAGAA GAAGAACTTA CACGAGAAGG TATTCCCGGA 

CCCCTTCTTC TTCTTCTGGC GGCGGTGGGG GAGGGGGGAC ACGGCGGCGA 
GGGGAAGAAG AAGAAGACCG CCGCCACCCC CTCCCCCCTG TGCCGCCGCT 

CGACGGCGCA CCGGGAGGCG GTCGACAAAG CGCTCGATCA TCTCCCCGCG 
6CTGCCGCGT OGCCCTCCGC CAGCTGTTTC GC6AGCTAGT AGAGGGGCGG 

GCGACGQCGC ATGGTCTC6G TGACGGCGCG GCCGTTCTCG CGGGGGCGCA 
CGCTGCCGCG TACCAGAGCC ACTGOCGCGC CGGCAAQAGC 0CCCCC6CGT 

GTTGSAAGAC GCCGCCCGTC AT6TCCCGGT TATGGGTTGG CGGGGG6CTG 
CAACCTTCTG CX^GCGGGCAG TACAGGGCCA ATACCCAACC GCCCCCCGAC 

CCATGC6GCA G6GATACGGC GCTAACGATG CATCTCAACA ATTGTTGT6T 
GGTAC6CCGT CCCTATGCCG C6ATTGCTAC GTAGAGTTGT TAACAACACA 

AC3GTACTCCG CCGCCGAGGG ACCTGAGCGA GTCCGCATCG ACOGGATCGG 
TCCAT6AGGC GGC6GCTCCC TGGACTCGCT CAGGCGTAGC TGOCCTAGCC 

AAAACCTCTC GAGAAAGGCG TCTAACCAGT CACAGTCGCA AGGTAGGCT6 
TTTTGGAGAG CTCTTTCCGC AGATTGGTCA GTGTCAGCGT TCCATCCGAC 

AGCACCGTG6 CGGGCGGCAG CGGGCGGCGG TCGGGGTTGT TTCTGGCGGA 
1CGTG6CACC 6CCCGCCGTC GCCCGCC6CC A6CCCCAACA AAGACCGCCT 
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8451 6GTGCTGCT6 I^^TGTJJiT TAAAGTAGGC G6TCTTGAGA CGGCGCt^G 
CCACGACGAC flB^ACATTA ATTTCATCCG CCAGAACTCT GCCGCCWTCC 

8501 TCGACAiSAAG CACCATGTCC TTGGGSTCCGG CCTGCTGAAT GC6CAGGCGG 
AGCTGTCTTC GTGGTACAG6 AACCCAOGCC GGACGACTTA CGCGTCCGCC 

8551 TCCCCCATGC CCCAGGCTTC GTTTTGACAT CGGCGCAGGT CTTTGTAGTA 
AGCCGGTACG GGGTCCGAA6 CAAAACTGTA GCC6CGTCCA GAAACATCAT 

8601 QTCTTGCATG AGCCTTTCTA CCGGCACTTC TTCTTCTCCT TCCTCTTGTC 
CAGAACGTAC TCGGAAAGAT GGCCGXGAAG AA6AAGAGGA AGGAGAACA6 

8651 CTGCATCTCT TOCATCTATC GCTGCGGCGG CG6CGGAGTT TGGCCGTAGG 
GACGTAGAGA ACGTAGATAG CGACGCCGCC GCCGCCTCAA ACCGGCATCC 

8701 TGGCGCCCTC nCCTCCCAT GCGTGTGACC CCGAAGCCCC TCATCGGCTG 
ACCGCGGGA6 AAGGAGGGTA CGCACACTGG GGCTTCGGGG AGTAGCCGAC 

8751 AAGCAGGGCT AGGTCGGCGA CAACGCGCTC GGCTAATATG GCCTGCTGCA 
TTCGTCCCGA TCCAGCCGCT GTTGCGCGAG CCGATTATAC CGGACGACGT 

8801 CCTGCGTGAG GGTAGACTGG AAGTCATCCA TGTCCACAAA GCGGTGGTAT 
GGAC6CACTC CCATCTGACC TTCAGTAGGT ACAGGTGTTT CGCCACCATA 

8851 GCGCCCGTGT TGATGGTGTA AGTGCAGTTG GCCATAACGG ACCAGTTAAC 
CGCGGGCACA ACTACCACAT TCACGTCAAC CGGTATTGCC TGGTCAATTG 

8901 GGTCTGGTGA CCCGGCTGCG AGAGCTCGGT GTACCTGAGA CGCGAGTAAG 
CCAGACCACT GGGCCGACGC TCTCGAGCCA CATGGACTCT GCGCTCATTC 

8951 CCCTCGAGTC AAATACGTAG TCGTTGCAAG TCCGCACCAG GTACTGCTAT 
GGGAGCTCAG TTTATGCATC "AGCAACGTTC AGGCGTGGTC "C ATGACCATA 

9001 CCCACCAAAA AGTGCGGCGG CGGCTG6CGG TAGAGGGGCC AGCGTAGGGT 
GGGTGGTTTT TCACGCCGCC GCCGACCGCC ATCTCCCCGG TCGCATCCCA 

9051 GGCCGGGGCT CCGGGG6CGA GATCTTCCAA CATAAGGCGA TGATATCCGT 
CCGGCCCCGA GGCCCCCGCT CTAGAAG6TT GTATTCCGCT ACTATAGGCA 

9101 AGAT6TACCT GGACATCCAG GTGATGCCGG CGGCG6TGGT GGAGGCGCGC 
TCTACATGGA CCTGTAGGTC CACTACG6CC GCCGCCACCA CCTCCGCGCG 

9151 GGAAAGTCGC GGACGCGGTT CCA6ATGTTG CGCAGC6GCA AAAAGTGCTC 
CCTTTCAGCG CCTGCGCCAA G6TC1ACAAC GC6TCGCCGT TTTTCACGAG 

9201 CATGGTCGGG ACGCTCTGGC CQ6TCAG6CG CGCGCAATCG TTGACGCTCT 
6TACCAGCCC TGCGAGACCG GCCAGTCCGC GCGC6TTAGC AACTGCGAGA 

9251 AGACCGTGCA AAAGGAGAGC CTGTAAGCGG GCACTCTTCC GTGGTCTGGT 
TCTGGCACGT TTTCCTCTCG 6ACATTCGCC CGTGAGAAGG CACCAGACCA 

9301 GGATAAATTC GCAAGGGTAT CATGGCGGAC GACCGGGGTT CGAGCCCCGT 
CCTATTTAAG COWCCCATA GTACCGCCTG CTGGCCCCAA GCTCGGGGCA 

9351 ATCCGGCCGT CCGCCGTGAT CCAT6CGGTT ACCGCCCGCG TGTCGAACCC 
TAGCCCGCCA GGCGGCACTA G6TACGCCAA TOGCGGGCGC ACAGCTTGGG 
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5401 AGGTGTGCGA cAaGACM CCCCCCACTG CTCCTTTTGG CTTCCT^fcA 
TCCACACGCT G^S&TCTGTT GCCCCCTCAC GAGGAAAACC GAAGGASBST 

9451 GGCGCGGCGG CTGCTGCGCT AGCTTTTTTG GCCACTOCCC 5CGCGCAGCG 
CCGCGCCGCC 6ACGACGCGA TCGAAAAAAC CGGTGACCGG CGCGCGTCGC 

9501 TAAGCGGTTA GGCTGGAAAG CGAAAG-IIATT AAGTGGCTCC CTCCCTGTAG 
ATTCGCCAAT CCGACCTTTC 6CTTTCGTAA TTCACCGAGC GAGGGACATC 

9551 CCGGAGGGTT ATTTTCCAAG GGTTGAGTCG CGGGACCCCC GGTTCGAGTC 
GGCCTCCCAA TAAAAGGTTC CCAACTCAGC GCCCTGGGGG CXAAGCTCAG 

9601 TCGGACCGGC CGGACTGCGG C6AAC6GGGG TTTGCCTCCC CGTCATGCAA 
AGCCTGGCCG 6CCTGACGCC GCTT6CCCCC AAAC6GAGGG 6CAGTACGTT 

9651 GACCCCGCTT GCAAATTCCT CCG6AAACAG GGACGAGCCC CTTTTTTGCT 
CTGGGGCGAA CGTTTAA6GA GGCCTTTGTC CCTGCTCGGG GAAAAAAC6A 

9701 TTTCCCAGAT GCATCCGGTG CTGCGGCAGA TGCGCCCCCC TCCTCAGCAG 
AAAGGGTCTA CGTAGGCCAC GACGCCGTCT ACGCGGGGGG AGGAGTCGTC 

9751 CGGCAAGAGC A^GAGCAGCG GCAGACATGC AGGGCACCCT CCCCTCCTCC 
GCCGTTCTCG TTCTCGTCGC CGTCT6TACG TCCCGTGSGA GGGGAGGAGG 

98 01 TACCGCGTCA GGAGGGGCCA CATCCGCGGT TGACGCGGCA GCAGATGGTG 
ATGGCGCAGT CCTCCCCGCT 6TAGGC3CCA ACTGCGCCGT CGTCTACCAC 

9851 ATTACGAftCC CCCGCGGCGC CGGGCCCGGC ACTACCTGGA CTTGGAGGAG 
TAATGCTTCG GGGCGCCGCG GCCCGGGCCG TGATGGACCT GAACCTCCTC 

9901 GGCGAGGGCC TGGCGCGGCT AGGAGCGCCC TCTCCTGAGC GGCACCCAAG 
CCGCTCCCGG AwCGCGCCGA TCCTCGCGGG AGAGGACTCG CCGTGQGTTC 

9951 GGTGCAGCTG AAGCGTGATA CGCGTGAGGC GTACGTGCCG CGGCAGAACC 
CCACGTCGAC TTCGCACTAT GCGCACTCCG CATGCACGGC GCCGTCTTG6 

10001 TGTTTCGCGA CCGCGAGGGA GAGGAGCCCG AGGAGATGCG GGATCGAAAG 
ACAAA6CGCT GGCGCTCCCT CTCCTCGGGC TCCTCTACGC CCTAGCTTTC 

10051 TTCCACGCAG GSCGCGAGCT GCGGCATGGC CTGAATCGCG AGCGGTTGCT 
AAGGTGCGTC CCGCGCTCGA CGCCGTACCG GACTTAGCGC TCGCCAACGA 

10101 GCGCGAGGAG GACTTTGAGC CCGACGCGCQ AACCGGGATT AGTCCCGCGC 
CGCGCTCCTC CTGAAACTCG GGCTGCGCGC TTGGCCCTAA TCAGGGCGCG 

10151 GCGCACACGT 6GCGGCCGCC GACCTGGTAA CCGCATACGA GCAGACGGTG 
CGCGTGTGCA CCGCCGGCGG CTGGACCATT GGCGTATGCT CGTCTGCCAC 

10201 AACCAGGAGA TTAACTTTCA AAAAAGCTTT AACAACCACG TGC6TACGCT 
TTGGTCCTCT AATT6AAAGT TTTTTOGAAA TTGTTQGTGC ACQCATGCGA 

10251 TGTGGCGCGC GAGGAGGTOG CTATAGGACT GATGCATCT6 TC66ACTTTG 
ACACCGCGCG CTCCTCCACC GATATOCTGA CTACGTAGAC ACCCTGAAAC 

10301 TAAGCGCGCT GGAGCAAAAC CCAAATAGCA AGCCGCTCAT GGCGCAGCTG 
ATTCGCCCGA CCTCGTTTTG GGrTTATCGr TCGGCGAGTA CCGCGTCGAC 
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10351 
10401 
10451 
10501 
10551 
10601 
10651 
10*701 
10751 
10801 
10851 
10901 
10951 
11001 
11051 
11101 
11151 
11201 
11251 



TTCCTTATA6 T^hf3CACA6 CA6GGACAAC GAGGCATTCA GGGATGlBiT 
AAGGAATATC ASSFCGTGTC GTCCCTGTTG CTCCGTAAGT CCCTAC^SSGA 

GCTAAACATA GTAGAGCCCG AG6GCCGCTG GCTGCTCGAT TTGATAAACA 
CGATTTGTAT CATCTCOGGC TCCCGGCGAC C6ACGAGCTA AACTATTTGT 

TCCTGCAGAG CATAGTGGTG CAGGAGCGCA GCTT6AGCCT G5CTGACAAG 
AGGACGTCTC GTATCACCAC GTCCTCGCGT CGAACTCGGA CCGACTGTTC 

GTGGCCGCCA TCAACTATTC CATGCTTAGC CTGGGCAAGT TTTACGCCCG 
CACCGGCGGT AGTTGATAAG GTACGAATCG GACCCGTTCA AAATGCG6GC 

CAAGATATAC CATACCCCTT ACGTTCCCAT AGACAAGGAG GTAAAGATCG 
GTTCTATATG GTATGGGGAA TGCAAGGGTA TCTGTTCCTC CATTTCTAGC 

AGGGGTTCTA CATGCGCATG GCGCTGAAGG TGCTTACCTT GAGCGACGAC 
TCCCCAAGAT GTACGCGTAC CGCGACTTCC ACGAATGGAA CTCGCTGCTG 

CTGGGCGTTT ATCGCAACGA GCGCATCCAC AAGGCCGTGA GCGTGAGCCG 
GACCCGCAAA TAGCGTTGCT CGCGTA65TG TTCCGGCACT CGCACTCGGC 

GCGGCGC6AG CTCAGCGACC GCGAGCTGAT GCACAGCCTG CAAAGGGCCC 
CGCCGCGCTC GAGTCGCTGG CGCTCGACIA CGTGTCGGAC 3TTTCCCGGG 

TGGCTGGCAC GGGCAGCGGC GATAGAGAGG CCGAGTCCTA CTTTGACGCG 
ACCGACCGTG CCCGTCGCCG CTATCTCTCC GGCTCAGGAT GAAACTGCGC 

GGCGCTGACC TGCGCTGGGC CCCAAGCCGA CGCGCCCTGG AGGCAGCTGG 
CCGCGACTGG ACGCGACCCG GGGTTCGGCT GCGCGGGACC TCCGTCGACC 

GGCCGGACCT GGGCTGGCGG TGGCACCCGC GCGCCCT6GC AAC6TCGGCG 
XCGGCCTGGA CCCGACCGCC ACCGTGGGC6-CGCGC6ACCG TTOCAGCGGC 

GCGTGGAGGA ATATGACGAG 6AC6AT3AGT ACGAGCCAGA GGACGGCGAG 
CGCACCTCCT TATACTGCTC CTGCTACTCA TGCTCGGTCT CCT6CC6CTC 

TACTAAGCGG TGATGTTTCT GATCAGATGA TGCAAGACGC AACGGACCCG 
ATGATTCGCC ACTACAAAGA CTAGTCTACT ACGTTCTGC6 TTGCCTGGGC 

GCGGTGCGGG C9GCGCTGCA GAGCCAGCC6 TCCGGCCTTA ACTCCAC6GA 
CGCCACGCCC GCCGCGACGT CTCCCTCGGC AGGCCGGAAT TGAGGTGCCT 

CGACTGGCGC CAGGTCATGG ACCGCATCAT GTCGCTGACT GCGCGCAATC 
GCTGACCGCG GTCCAGTACC TGGCGTAGTA CAGCGACTGA CGCGCGTTAG 

CT6ACGCGTT CCGGCAGCA6 CCGCAQGCCA ACCGGCTCTC CGCAATTCTG 
GACTGCGCAA GC3CCGTC6TC GGCGTOCGGT TQQCCGAGAG CC6TTAAGAC 

GAAGCGGTGG TCCCGGCGCG CGCAAACCCC ACGCACGAGA AGGTGCTGGC 
CTTCGCCACC AGG6CCGCGC GCGTTTGGGG TGCGTGCTCT TCCACGACCG 

GATCGTAAAC OCGCTGGCCG AAAACAGGGC CATCCGGCCC GACGAGGCCG 
CTAGCATTTG CGCGACCGGC TTTTGTCCCG GTAGGCCGGG CTGCTCCGGC 

GCCTGGTCTA CGACGCGCTG CTTCAGCGCG TGGCTCGTTA CAACAGCGGC 
CGGACCAGAT GCTGCQC6AC GAAGTCGCGC ACCGAGCAAT GTTGTCGCCG 




107/144 



wo 02/22080 



PCT/USOl/28861 



11301 

11*3 5 X 

11401 

11451 

11501 

11551 

11601 

11651 

11701 

11751 

11801 

11851 

11901 

11951 

12001 

12051 

12101 

12151 

12201 



AACGTGCAGA C^hrCTGGA CCGGCTGGTG GGGGATGTGC' iSCGAasP^T 
TTGCACGTCT G3f5'GGACCT GGCCGACCAC CCCCTACACG CGCTCCSBtA 

GGCGCAGCGT GAGCGCGCGC AGCAGCAGG6 CAACCTGGGC TCCATGGTTG 
CCGCGTCGCA CTCGCGCGCG TCGTCGTCCC GTTGGACCCG AGGTACCAAC 

CACTAAACGC CTTCCTGAGT ACACAGCCCG CCAACGTGCC GCGGGGACAG 
GTGATTTGCG GAAGGACTCA TGTGTC3GGC GGTTGCACGG CGCCCCTGTC 

GAGGACTACA CCAACTTTGT GAGC6CACTG CGGCTAATGG TSACTGAGAC 
CTCCTGATGT OSTTGAAACA CTCGCGTGAC GCCGATTACC ACTGACTCTG 

ACCGCAAAGT GAGGTGTACC AGTCTGGGCC AGACTATTTT TTCCAGACCA 
TGGCX5TTTCA CTCCACATGG TCAGACCCG6 TCTGATAAAA AAGGTCT66T 

GTAGACAAGG CCTGCAGACC GTAAACCT6A GCCAGGCTTT CAAAAACTTG 
CATCTGTTCC GGACGTCTGG CATTTGGACT CGGTCCGAAA GTTTTTGAAC 



CAGGGGCTGT GGOGGGTGCG G0CTCCCACA GGCGACCGCG CGACCGTGTC 
GTCCCCGACA CCCCCCACGC CCGA6QGTGT CCGCTGGCGC GCTGGCACAG 

TAGCTTGCT6 ACGCCCAACT CGCGCCTGTT GCTGCTGCTA ATAGCGCCCT 
ATC6AACGAC TCCGGGTTGA GCGC6GACAA CGACGACGAT TATCGCGGGA 

TCAC6GACA0 TGOCAGCGTG TCCCGGGACA CATACCTAGG TCACTTSCTG 
AGTGCCTGTC ACC6TCGCAC AGGGCCCTGT GTATGGATCC AGTGAACGAC 

ACACTGTACC GCGAGGCCAT AGGTCAGGCG CATGT6GACG A6CATACTTT 
TGTGACATCG CGCTCCGGTA TCCAGTCCGC GTACACCTGC TCGTATGAAA 

CCAGGAGATT ACAAGTGTCA GCCGCGCGCT GGGGCAGGAG GACACGGGCA 
GGTCCTCTAA TGTTCACAGT CGGCGCGCGA CCCCGTCCTC CTGTGCCCGT 

GCCTGGAGGC AACCCTAAAC TACCTGCTGA CCAACCGGCG GCAGAAGATC 
CGGACCTCCG TT6GGATTTG ATGGACGACT GGTTGGCCGC CGTCTTCTAG 

CCCTCGTTGC ACAGTTTAAA CAGCGAGGAG GAGCGCATTT TGCGCTACGT 
GGGAGCAACG TGTCAAATTT GTCGCTCCTC CTCGCGTAAA ACGCGATGCA 

GCAGCAGAGC GTGACCCTTA ACCTGATGCG CGACGGGGTA ACGCCCAGCG 
CGTCGTCTCG CACTCGGAAT TGGACTACGC GCTGCCCCAT TGCGGGTCGC 

TGGCGCTGGA CATGACCGCG CGCAACATGG AACCGGGCAT GTATGCCTCA 
ACCGCGACCT GTACTGGCGC GCGTTGTACC TTGGCCCGTA CATACGGAGT 

AACCGGCCGT TTATCAACCG CCTAATCGAC TACTTGCATC GCGCGGCCGC 
TTGGCCGGCA AATAGTTGGC GGATTACCTG ATGAACGTAG CGCGCCGGCG 

CGTGAACCCC GAGTATTTCA CCAATGCCAT CTTGAACCCG CACTGGCTAC 
GCACTTGGGG CTCATAAAGT GGTTACGGTA GAACTTGGGC GTGACCGATG 

CGCCCCCTGG TTTCTACACC CGGGGATTC6 AGGTGCCCGA QGGTAACGAT 
GCGGGGGACC AAAGATGTGG CCCXTTAAGC TCCACGGGCT CCCATTGCTA 

GGATTCCTCT GGGACGACAT AGACGACAGC GTGTTTTCCC CGCAACCGCA 
CCTAAGGAGA CCCTGCTGTA TCTGCTCTCC CACAAAAGG6 GCGTTQGCGT 
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12251 GACCCTGCTA Gi«rTGCAAC AGCGCGAGCA GGCAGASCJCG^GCT^aafiA 
CTGGGACGAT cIHaCGTTG TCGCGCTCGT CCGTCTCCGC CGCGAdV!^ 

12301 AGGAAA6CTT CCGCAGGCCA AGCA6CTTGT CCGATCTAGG CX3CTGCG6CC 
TCCTTTCGAA GGCGTCC6GT TCGTCGAACA GGCTAGATCC 6CGACGCCGG 

12351 CCGCGGTCAG ATGCTAGTAG CCCATTTCCA AGCTTGATAG GGTCTCTTAC 
GGCGCCAGTC TACGATCATC GGGTAAAGGT TC6AACTATC CCAGAGAATG 

12401 CAGCACTCGC ACCACCCGCC CGCGCCTGCT GGGCGAGGAG GAGTACCTAA 
GTCGTGAGCG TGGTGGGCGG GCGCGGACGA CCCGCTCCTC CTCATGGATT 

12451 ACAACTCGCT GCTGCAGCCG CAGCGC6AAA AAAACCTGCC TCCGGCATTT 
TGTTGAGCGA CGACGTCGGC GTCGCGCTTT TTTTG6ACGG AGGCCGTAAA 

12501 CCCAACAACG 6GATAGAGAG CCTAGTGGAC AA6ATGAGTA GATGGAAGAC 
GGGTTGTTGC CCTATCTCTC 6GATCACCTG TTCTACTCAT CTACCTTCTG 

12551 GTACGCGCAG GAGCACAGGG ACGTGCCAGG CCCGCGCCCG CCCACCCGTC 
CATGCGCGTC CTCGTGTCCC TGCACGGTCC GGGCGCGGGC GGGTGGGCAG 

12601 GTCAAAGGCA CGACC6TCA6 CGGGGTCTGG TGTGGGAGGA CGATGACTCG 
CAGTTTCCGT 6CTGGCAGTC 6CCCCAGACC ACACCCTCCT GCTACTGAGC 

12651 6CAGACGACA 6CAGC6TCCT GGATTTOGGA GGGA6TGGCA ACCCGTTTGC 
CGTCTGCTGT CGTCGCAGGA CCTAAACCCT CCCTCACCGT TGGGCAAACG 

12701 GCACCTTCGC CCCAGSCTGG GGA6AATGTT TTAAAAAAAA AAAAAGCATG 
CGTGGAAGCG &3GTCCGACC CCTCTTACAA AATTTTTTTT TTTXTCGTAC 

12751 ATGCAAAATA AAAAACTCAC CAAGGCCATG GCACCGAGCG TTGGTTTTCT 
— TACGTTTTAT-TPT-TTiSACTG GTTCCG3TAC -CGTGGCTCGC -AACCAAAAGA - 

12801 TGTATTCCCC TTAGTATGCG GCGCGCG6CG ATGTATSAGG AA6GTCCTCC 
ACATAAGGGG AATCATACGC CGCGCGCCGC TACATACTCC 7TCCAGGAGG 

12851 TCCCTCCTAC GAGA6TGTGG TGAGCGCGGC GCCAGTGGCG GCGGCGCTGG 
AGGGAG6ATG CTCTCACACC ACTCGCGCCG CGGTCACCGC CGCCGCGACC 

12901 GTTCTCCCTT CGATGCTCCC CTGGACCCGC CGTTTGTGCC TCCGCGGTAC 
CAAGAGGGAA GCTACGA66G GACCTGC3GCG GCAAACACGG . AGGCGCCATG 

12951 CTGCGGCCTA CCGGGGGGAG AAACAGCATC CGTTACTCTG AGTTGGCACC 
GACGCCGGAT GGCCCCCCTC TTTG7CGTAG GCAATGAGAC TCAACCGT6G 

13001 CCTATTCGAC ACCACCCGTG TGTACCTGGT GGACAACAAG TCAACGGATG 
GGATAAGCTG TGGTGGGCAC ACATG6ACCA CCTGTTGTTC AGTTGCCTAC 

13051 TGGCATCCCT GAACTACCAG AACGACCACA GCAACTTTCT GACCACGGTC 
ACCGTAGGGA CT7GATG6TC TTGCTGGTGT CGTTGAAAGA CTGGTGCCAG 

13101 ATTCAAAACA ATGACTACAG CCCGGGGGAG GCAAGCACAC AGACCATCAA, 
TAAGTTTTGT TACTGATGTC GGGCCCCCTC CGTTCGTGTG TCTGGTAGTT 

13151 TCTTGACGAC CGGTCGCACT GGGGCGGCGA CCTGAAAACC ATCCTGCATA 
AGAACTGCTG GCCAGCGTGA CCCCGCCGCT GGACTTTTGG TAGGACGTAT 
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13201 CCAACATGCC AMK^TGAAC GAGTTCATGT TTACCAATAA "^UlTARfiifiG 
GQTTGTACGG rtBlCACTTG CTCAAGTACA AATCCTTATT CAAATT^PC 

13251 CGGGTGATGG TGTCGCGCTT GCCTACTAAG GACAATCAGG TGGAGCTGAA 
C3CCCACTACC ACAGCGCGAA CGGArCATTC CtGTTAGTCC ACCTCGACTT 

13301 ATACGAGTGG GTGGAGTTCA CGCTGCCCGA GGGCAACTAC ?CCGAGACCA 
TATGCTCACC CACCTCAA6T GCGACGOGCT CCCGTTGATG AGGCTCTGGT 

13351 TGACCATAGA CCTTATGAAC AACGCGATCG TCGAGCACTA CTTGAAAGTG 
ACTGGTATCT GGAATACTTG TTGCGCTAGC ACCTCGTGAT GAACTTTCAC 

13401 GGCAGACAGA ACGGGGTTCT GGAAAGCGAC ATCGGGGTAA AGTTTGACAC 
CCGTCTGTCT TCCCCCAAGA CCTTTCGCTG TAGCCCCATT TCAAACTGTG 

13451 CCGCAACTTC AGACTGGGGT TTGACCCCGT CACTGGTCTT GTCATGCCTG 
GGCGTTGAAG TCTOACCCCA AACTGGGGCA GTGACCAGAA CAGTACGGAC 

13501 GGGTATATAC AAACGAAGCC TTCCATCCAG ACATCATTTT 6CTGCCAGGA 
CCCATATATG TOTGCTTCG6 AAGGTAGGTC TGTAGTAAAA CGACGGTCCT 

13551 TGCGGGGTGG ACTTCACCCA CAGCCGCCTG AGCAACTTGT TGGGCATCCG 
ACGCCCCACC TGAAGTGGGT GTCGGCGGftC TCGTTGftACA ACCCGTAGGC 

13601 CAAGCGGCAA CCCTTCCAGG AGGGCTTTAG GATCACCTAC GATGATCTOG 
GTTCGCCQTT GGGAAG6TCC TCCCGAAATC CTAGTGGATC CTACTAGACC 

13651 AGGGTGGTAA CATTCCCGCA CTGTTGGATG TGGACGCCTA CCAGGCGAGC 
TCCCACCATT GTAAGGGCGT GACAACCTAC ACCTGCGGAT GGTCCGCTCG 

13701 TTGAAAGATG ACACCGAACA GGGCGGGGGT GGCGCAGGCG GCAGCAACAG 
AACTTTCTAC TGTGGCTTGT CCCGCCCCCA CCGCGTCCGC CGTC6TTGTC 

13751 CAGTGGCAGC 6GCQC6GAAG AGAACTCCAA CGCGGCAGCC SCGGCAATGC 
GTCACCGTCG CCGCGCCTTC TCTTGAGGTT GCGCCGTCGG CGCCGTTACG 

13801 AGCCGGTGGA GGACATGAAC GATCATGCCA TTCGCG6CGA CACCTTTGCC 
TCGGCCACCT CCTGTACTTG CTAGTACGGT AAGCGCC6CT GTGGAAACGG 

13851 ACACGGGCTG AGGAGAAGCG CGCTGAGGCC GAAGCAGCGG CCGAAGCTGC 
TGTGCCCGAC TCCTCTTCGC GCGACTCCGG CTTCGTCGCC GGCTTCGACG 

13901 CGCCCCCGCT GCGCAACCCG AGGTCGAGAA GCCTCAGAAG AAACCGGTGA 
GCGGGGGCGA CGCGTTGGGC TCCAGCTCTT CGGAGTCTTC TTTGGCCACT 

13951 TCAAACCCCT 6ACA6AGGAC AGCAAGAAAC GCAGTTACAA CCTAATAAGC 
ASTTTGGGGA CTGTCTCCT6 TCGTTCTTTG CGTCAATGTT GGATTATTC6 

14001 AATGACAGCyi CCTTCACCCA GTACCGCAGC TOGTACXTTTG CATACAACTA 
TTACTGTCGT GGRAGTGGGT CATGGCGTCG ACCATGGAAC GTATGTTGAT 

14051 CGGCGACCCT CAGACCGGAA TCCGCTCATG GACCCTGCTT TGCACTCCTG 
GCCGCTGQGA GTCTGGCCTT AGGCGAGTAC CTGGGACGAA ACGTGAG6AC 

14101 ACGTAACCTG CGGCTCGGAG CAGGTCTACT GGTCGTTGCC AGACATGATG 
TCCATTCGAC 6CCGAGCCTC GTCCAGATGA CCAGCAACGG TCTGTACTAC 
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14X51 
14201 
14251 
14301 
14351 
14401 
14451 
14501 
14551 
14601 
14651 
14701 
14751 
14BC1 
14B51 
14901 
14951 
15001 
15051 



CAAGACCCCG Ts^^^nTCCG tTCCACGCGC CASATCAGCA ACTTTCflBMT 
GTTCTGGGSC AdKGAAGGC GAGGrTGCGCG GTCTAGTCGT TGAAAG<»5Ca 

GGTGGGCGCC GAGCTGTTGC CCGTGCACTC CAAGAGCTTC TACAACGACC 
CCACCCGCGG CTCGACAACG 6GCACGTGAG GTTCTC6AAG ArGTTGCTGG 

AGGCCGTCTA CTCCCAACTC ATCCGCCAGT TTACCTCTCT 6ACCCACGT6 
TCCGGCAGAT GAGGGTTGAG TAGOCGGTCA AATGGAGAiGA CT0GGT6CAC 

TTCAATCGCT rrCCCGAGAA CCAGATTTTG GCOCGCCCGC CAGCCCCCAC 
AAGTTA6C6A AAG6GCTCTT GGTCTAAAAC CGC6CG60CG GTCGG6GGTG 

CATCACCACC GTCAGTGAAA ACGTTCCTGC TCTCACAGAT O^GGGACGC 
GTAGTGGTG6 CAGTCACTTT TGCAAGGACG AGAGTGTCTA 6TGCCCTGCG 

TACCGCTGCC CAACAGCATC GGAGGAGTCC ACCCACTGAC CATTACT6AC 
ATGGCGACGC GTTGTCGTAG CCTCCTCAQQ TCGCTCACTG GTAATGACTG 

GCCAGACGCC GCACCTGCCC CTACGTTTAC AAGGCCCTGG GCATAGTCTC 
CGGTCTGCGG CGTGGACGGG GATGCAAATC TTCCGGGACC CGTATCAGAG 

6CCGCGCGTC CTATO3AGCC GCACTTTTTG AGCAAGCATG TCCATCCTTA 
CGGCGCGCAG 6ATAGCTCGG CGT6AAAAAC TCGTTCGTAC AGGTAGGAAT 

TATCGCCCAG CAATAACACA GQCTG09QCC TGCGCTTCCC AAGCAAGATG 
ATAGCGGGTC GTTATTGTGT CCGACCCCGG ACGCGAAGGG ITCGTTCTAC 

TTTGGCGGGG CCAAGAAGCG CTCCGACCAA CACCCAGTGC QCGTGCGCGG 
AAACCGCCCC GGTTCTTCGC GAGGCTGGTT QTGGGTCACG CGCACGCGCC 

GCACTACCGC GCGCCCTGGG GCGCGCACAA ACGCGGCCGC ACTGGGCGCA 
CGTGATGGCG CGCGGGACCC CGCGCGrGTT TGCGCCGGCC TCACCCCCGT 

CCACCGTCGA TGACGCCATC GACGCGGTGG TGGAGGAGGC QCGCAACTAC 
GGTGGCAGCT ACTGCGGTAG CTGCGCCACC ACCTCCTCCG CGCOTTSATG 

ACGCCCACGC CGCCACCAGT GTCCACAGTG GACGCGGCCA TTCAGACCGT 
TGCGGGTGCG GCGGTGGTCA CAGGTGTCAC CTGCGCCGGT AAGTCTGGCA 

GGTQCGCGGA GCCCGOCGCT ATGCTAAAAT GAAGAGACG6 CCGAGGCGCC 
CCACGCGCCT CGGGCCGCGA TACGATTITA CTTCTCTGCC GCCTCCGCGC 

TAGCACGTC6 CCACCGCCGC CGACCCG3CA CTGCCGCCCA ACGCGCGGCG 
ATCGfTGCAGC GGTQGC6GCG GCTGG6C2GT GACQGCGGGT TGCGCGCCGC 

GCGGCCCTGC TCAACCGCGC ACGTCGCACC GGCCGAC6G6 CGGCCATGCG 
CGCCGGGACG AATTGGCGCG TGCAGCGXGG CCGGCTGCCC GCCGGTACGC 

GGCCGCTC6A A6GCTGGCCG CGGGTATTGT CACTGTGCCC CCCAGGTCCA 
CCGGCGAGCT TCCGACCGCC GCCCATAACA GTGACACGGG GGGTCCAGGT 

GGCGACGAGC GGCCGCCGCA GCAGCCGCGG CCATTAGT6C lATQACTCAG 
CCGCTGCTCG CCCQCX^GCOT CGTCG6CGCC GGTAATCACG ATACTGAGTC 

GGTCGCAOGG GCAACGTGTA TTGGGTGC6C GACTCGGTTA GCG6CCTGC6 
CCAGCGTCCC CGTTGCACAT AACCCACGCG CTGAGCCAAT CGCCGGACGC 
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15101 
15151 
15201 
15251 
15301 
15351 
15401 
15451 
15501 
15551 
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15551 
15701 
15751 
15801 
15851 
15901 
15951 
16001 



0CAC6GGCAC gSH^ggcgg ggggcgcott gatctaacgt tcttttRBv 

ACTTA6?^CTC GTACTGTTGT ATOTATCCXG C66C6GCGGC GCGCAACGAA 
TGAATCTGAG CATGACAACA TACATAGGTC GCCGCCGCCG CGCGTTGCTT 

GCTATGTCCA AOCGCAAAAT CAAA6AAGA6 ATGCTCCAGG TCATCGCGCC 
C6ATACAGGT TCOCGTTTTA GTTTCTTCTC TACGAGGTCC AGTAGCGCGG 

GGAGATCTAT GGCCCCCCGA AGAAGGAAGA GCAGGATTAC AAGCCCCGAA 
CCTCTAGATA CCGGGGGGCT TCTTCCTTCT CGTQCTAATG TTC3GGGCTT 

AGCTAAAGCG GGTCAAAAAG AAAAAGAAAG ATGATGATGA TGAACTTGAC 
TCGATTTCGC CCAGTTTTTC TTTTTCT?TC TACTACTACT ACTTGAACTG 

GACGAGGTGG AACTGCTGCA CGCTACCGCG CCCAOGCGAC GGGTACAGTG 
CTGCTCCACC TTGACGACGT GC6ATGGCGC GGGTCCGCTG CCCATGTCAC 

6AAAGGTCGA CGCGTAAAAC 6T6TTTTGCG ACCC6GCACC ACCGTAGTCT 
CTTTCCAGCT GCGCATTTTG CACAAAACGC TGGGCCGTG6 T6GCA7CAGA 

TTACGCCCGG roAGCGCTCC ACCCGCACCT ACAAGCGCGT GTATGATGAG 
AATGCGGGCC AC7C0CGAG6 TG6GCGTG6A TGTTCGCGCA CATACTACTC 

CTGTACGCCG ACGAGGACCT GCTTGAGCAG GCCAACGAGC GCCTCGGGGA 
CACAT6CCGC TGCTCCTGGA CGAACTCGTC CGGTTGCTCG CCCAGCCCCT 

GTTTGCCTAC GGAAAGCGGC ATAAGGACAT GCTG6CGTTG CCGCTGGACG 
CAAACGGATG CCTTrCGCCG TATTCCTGTA CGACCGCAAC GGCGACCTGC 

AGGGCAACCC AACACCTAGC CTAAAGCCCG TAACACTGCA 6CAGGTGCTG 
TCCCGTTGGG TTGTGGATCG GATTTCGGGC ATTGTGACGT C3TCCACGAC 

CCCGCGCTTQ CACCGTCCGA AGAAAAGCGC GGCCTAAAGC GCGAGTCTOC 
6GGCGCGAAC GTGGCAOGCT TCTTTTCGCG CCGGATTTCG CGCTCAGACC 

TGACTTGGCA CCCACCGTGC AGCTGATCGT ACCCAAGCGC CAGCGACTQG 
ACTGT^CCGT GGGTGGCACO TCGACTACCA T6GGTTCGCG GTCGCTGACC 

AA6ATGTCTT GGAAAAAATG ACCGTGGAAC CTGGGCTGGA GCCCGAGGTC 
TTCTACAGAA CCTTTTTOAC TCCCACCTTC CACCCCACCT C3GGCTCCAG 

CGCGTGCGGC CAATCAAGCA GGTGGCGCCG GGACTGGGCG T3CAGACCGT 
GCGCACGCCG GTTAGTTCGT CCACCGCGGC CCTGACCCGC A2GTCTGGCA 

GGACGTTCAG ATACCCACTA CCAGTAGCAC CAGTATTGCC ACCGCXIACAG 
CCTGCAAGTC 1A7GGGTGAT GGTCATCGTG GTCATAACGG TGGCGGTGTC 

AGGGCATGGA GACACAAACG TCCCCCGTTC CCTCAGCGGT GGCGGATGCC 
TCCCXSTACCT CTGTGTTTGC AG6Q0CCAAC OQAGTCGCCA CCGCCTAC6G 

QCGGTGCAG6 CGGTC0CT6C GGCCGC6TCC AAGACCTCTA CGGAGGTGCA 
CGCCAC6TCC GCCAGCGAC6 CCGGCGCAG6 TTCTGGAGAT GOCTCCACGT 

AACGGACCCG TOGATGTTTC 6CCTTTCAGC CCCCCGGCGC CCGCGCCGTT 
TTGCCTGGGC ACCTACAAAG CGCAAAGTCG GG6GGCCGC6 GGCGCGGCAA 




112/144 



wo 02/22080 



PCT/USOl/28861 



16051 
16101 
16151 
16201 
16251 
16301 
16351 
16401 
16451 
16501 
— 16551 
16601 
16651 
16701 
16751 
16801 
16851 
16901 
16951 



CGAGGAAGTA caPTGCCGCC AGCGCGCTAC TGCCCGAATA TGCCCTJl^T 
6CTC.CTTCAT GCCGCGGCGG TCGCGCGATG ACGGGCTTAT ACGGGATGTA 

CCTTCCATTG CGCCTACCCC CGGCTATCGT GGCTACACCT ACCGCCCCAG 
GGAAGGTAAC GCGGATGGGG GCCGATAGCA CCGATGTGGA TGGCGGGGTC 

AAGACGAGCA ACTACOCGAC GCCGAACCAC CACTGGAACC CGCCGCCGCC 
TTCTGCTCGT TGA7GGGCTG CGGCTTGGTG GTGACCTTGG GCGGCGGCGG 

GTCGCCGTCG CCAGCCCGTG CTGGCCCCGA TTTCCGTGCG CAGGGTGGCT 
CAGCGGCAGC GGTCGGGCAC GACCGGGGCT AAAGGCACGC GTCCCACCGA 

CGCGAAGGAG GCAGGACCCT GGTGCTGCCA ACAGCGCGCT ACCACCCCAG 
GCGCTTCCTC CGTCCTGGGA CCACGACGGT T6TCGCGCGA TGGTGGGGTC 

CATCGTTTAA AAGCCGGTCT TTGTGGTTCT TGCAGATATG 6CCCTCACCT 
GTAGCAAATT TTCGGCCAGA AACACCAAGA ACGTCTATAC CGQGAGTGGA 

GCCGCCTCC6 TTTCCCGGTG CCG6GATTCC GAGGAAGAAT GCACC6TAGG 
CGGCGGAGGC AAAGQGCCAC GGCCCTAAGG CTCCTTCTTA CGTGGCATCC 

AGGGGCAltSG CCGGCCACGG CCTGACGGGC GGCATGCGTC GTGCGCACCA 
TCCCCGTACC GGCCGGTGCC GGACTGCCCG CCGTACGCAG CACGC6TGGT 

GCGGCGGCGG CGCGCGTCGC ACCGTCGCAT GCGCGGCGGT ATCCTGCCCC 
GGCCGCCGCC GC6CGCAGCG TGGCAGCGTA CGCGCCGCCA TAGGACGGGG 

TCCTTATTCC ACTGATCGCC GCGGCGATTG GCGCCGTGCC CGGAATTGCA 
AGGAATAAG6 TGACTAGCGG C6CCGCTAAC CGCGGCACGG 6CCTTAACGT 

TGCGTGGCCT- TGCAOOCGCA^ GAGACACTGA-TTAAAAACAA_6TTGCATGTG 
AGCCACCGGA ACGrcCGCGT CTCTGTGACT AATTTTTGTT CAACGTACAC 

GAAAAATCAA AATAAAAAGT CTGGACTCTC ACGCTCGCTT GGTCCTGTAA 
CTTTTTAGTr TTATTTTTCA GACCTGA6A6 TGCGAGCGAA CCAGGACATT 

CTATTTTGTA GAATGGAAGA CATCAACTTT GCGTCTCTGG CCCCGCGACA 
GATAAAACAT CTTACCTTCT 6TAGTTGAAA CGCAGAGACC GGGGCGCTGT 

CGGCTCGCGC CC6TTCATGG GAAACTGGCA AGATATCGGC ACCAGCAATA 
GCCGAGCGCG GGCAAGTACC CTTTGACCGT TCTATAGCCG TGGTCGTTAT 

T6A6CGGTGG C6CCTTCAGC TGGGGCT'::GC TGTGGAGCGG CATTAAAAAT 
ACTCGCCACC GCGGAAGTCG ACCCCGAQCG ACACCTCGCC GTAATTTTTA 

TTC6GTTCCA CCGTTAAGAA CTATGGCAGC AAGGCCTGGA ACAGCAGCAC 
AAGCCAAGGT G6CAATTCTT GATACCGTCG TTCCGGACCT TGTCGTCGTG 

AGGCCAGATG CTGAGGGATA AGTTGAAAGA GCAAAATTTC CAACAAAAGG 
TCCGGTCTAC GACTCCCTAT TCAACTTTCT CGTTTTAAAG GTTGTTTTCC 

TGGTAGATGG CCTG6CCTCT GGCATTAGCG GGGTGGTGGA CCTGGCCAAC 
ACCATCTACC GGACC6GAGA CCGTAATCGC CCCACCACCT GGACCGGTTG 

CAGGCAGTGC AAAATAAGAT TAACAGTAAG CTTGATCCCC GCCCTCCCGT 
GTCCGTCACG TTITATTCTA ATTGTCATTC GAACTAG6GG CGGGAGGGCA 
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17001 
17051 
171D1 
17151 
17201 
17251 
17301 
17351 
17401 
17451 
17501 
17551 
17601 
17S51 
17701 
17751 
17801 
17851 
17901 



AGAGGAGCCT Q^CGGCCG ' TGGAdACAGT GTCTCCAGAG GGGCGTMP 
TCTCCTCGGA g8RgCCGGC ACCTCTGTCA CAGAGGTCTC CCCGCaSWc 

AAAA6CGTCC GCGCCCCGAC AGGGAAGAAA CTCTGGTGAC GCAAATAGAC 
TTTTCGCAG6 CGCGGGGCT6 TCCCTTCTTT GA6ACCACTG CGTTTATCTS 

GAGCCTCCCT CGTACGAGGA G6CACTA&A6 CAAGGCCTQC CCACCACCCG 
CTCGGAGGGA QCATGCTCCT CCGTGRr?TC GTTCCGGACG GGTGGTGGGC 

TCCCATCGCG CCCATGGCTA CCGGAGTGCT GGGCCAGCAC ACACCCGTAA 
A6GGTAGCGC OGGTACCGAT GGCCTCACGA CCCGGTCGT6 T6TG6GCATT 

CGCTGGACCT GCCTCCCCCC GCCGACACCC AGCAGAAACC TGTGCTGCCA 
GCGACCTGGA CGGAGGGGGG CGGCTGTG6G 7CGTCTTTGG ACACGACGGT 

6GCCCGACCG CCGTTGTTGT AACCCGTCCT AGCCGCGCGT CCCTGCGCCG 
CCGGGCTGGC GGCAACAACA TTGGGCAGGA TCGGCGCGCA 6G6ACGCGGC 

CGCCGCCAGC QGTCCGCGAT CGTTGCGGCC CGTAGCCAGT GGCAACTGGC 
GCGGCGGTCG CXIAGGCGCTA GCAACGCC6G GCATCGGTCA CCGTTGACCG 

AAA6CACACT GAACA6CATC GTGGGTC7GG GGGTGCAATC CCTGAAGCGC 
TTTCGTGTGA CTTGrCGTAG CACCCAGACC CCCACGTTAG GGACTTCGCG 

CGACGATGCT TCTGATAGCT AACGTGTCGT ATGTGTGTCA TCTATCCCTC 
GCTGCTACGA AGACTATCGA TTGCACA6CA TACACACAGT ACATACGCAG 

CATGTCGCCG CCAGAGGAGC TGCTGAGCCG CCGCGCGCCC GCTTTCCAAG 
GTACAGCGGC GGTCTCCTCG ACGACTCGGC GGCGCGCGGG CGAAAGGTTC 

ATGGCTACCC CTTCGATGAT GCCGCAGTGG TCTTACATGC ACATCTCGGG 
TACCCATGCC GAACCTACTA CCCCCTCACC AGAATGTACG T3TAGAGCCC 

CCAGGACGCC TCGGAGTACC TGAGCCCCGG GCTGGTGCAG TrXGCCCGCG 
GGTCCTGCG6 AGCCTCATGG ACTCGGGGCC CGACCACGTC AAACGGGCGC 

CCACCGAGAC GTACTTCAGC CTGAATAACA AGTTTAGAAA CCCCACGGTG 
GGTGGCTCTG CAT6AAGTCG GACTTATT6T TCAAATCTTT 63GGT6CCAC 

GCGCCTACGC ACCACGTGAC CACAGACCCC TCCCAGCGTT TSACGCTGCG 
CGCGGATGCG TOCTGCACTG GTGTCTQGCC AGGGTCGCAA ACTGC6ACGC 

GTTCATCCCT GTGGACCGTG AGGATACTGC GTACTCGTAC AAGGCGCGGT 
CAAGTAGG6A CACCTGGCAC TCCTATGACG CATGAGCATG TTCCGC6CCA 

TCACCCTAGC TG7GGGTGAT AACCGTGTGC TGGACATGGC TTCCACGTAC 
AGTGGGATCG ACACCCACTA TTG6CACACG ACCTGTACCG AAGGTGCAT6 

TTTGACATCC GCGGCGTGCT GGACA0G93C CCTACTTTTA AGCCCTACTC 
AAACTGTAGG CGCCGCACGA CCTGTCCCCG GGATGAAAAT TCGGGATGAG 

T6GCACTGCC TACAACGCCC TGGCTCCCAA GGGTGCCCCA AATCCTTGCG 
ACCGTGACGG A7GTTGCGGG ACCGAGGGTT CCCACGGGGT TXAGGAACGC 

AATGGGATGA AGCTGCTACT GCTCTTQAAA TAAACCTAGA AGAA6AGGAC 
TTACCCTACT TCGACGATGA CGAGAACTTT ATTTGGATCT TCTTCTCCTG 
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17951 
18001 
18051 
3 8101 

leisi 

18201 
18251 
18301 
18351 
18401 
18.451 
18501 
18551 
18601 
18651 
18701 
18751 
18801 
18851 



GATGACAACG PJ^^g^GhhGT AGACGAGCAA GCTQAGCAGC AMAfA^^ 
CTACTGTTGC TTCTGCTTCA TCTGCTCGTT CGACTC6TCG TTTTTTGAGT 

CGTATTTGGG CAGGCGCCTT ATTCTGGTAT AAATATTACA AAGGAGGGTA 
GCATAAACCC GTCCGCGGAA TAAGACCATA TTTATAATCT TTCCTCCCAT 

TTCAAATAGG TGTCGAA6GT CAAACACCTA AATATGCCGA TAAAACATTT 
AAGTTTATCC ACAGCTTCCA GTTTGTGGAT TTATACGGCT ATTTTGTAAA 

CAACCTGAAC CTCAAATAGG AGAT^TCAG TGGTACGAAA CAGAAATTAA 
GTTGGACTTG GAGTTTATCC TCTTAGA6TC ACCATGCTTT 6TCTTTAATT 

TCATGCAGCT GGGAGAGTCC TAAAAAAGAC TACCCCAATG AAACCATGTT 
AGTACGTCGA CCCTCTCAGG ATTTTTTCT6 ATGGGGTTAC TTTGGTACyUl 

ACGGTTCATA TGCAAAACCC ACAAATGAAA ATGGAGGGCA AQGCATTCTT 
TGCCAAGTAT ACGTTTTGGG TCTTTACTTT TACCTCCCGT TCCGTAAGAA 

GTAAAGCAAC AAAAT6GAAA GCTAGAAAGT CAAGTGGAAA TGCAATTTTT 
CATTTCGTTG TTTTACCTTT CGATCTTTCA 6TTCACCTTT ACGTTAAAAA 

CTCAACTACT 6AGGCAGCCG CAGGCAATGG TGATAACTTG ACTCCTAAAG 
6AGTTGATGA CTCCGTCGGC GTCCGTTACX: ACTATTGAAC TGAGGATTTC 

TGGTATTGTA CAGTGAAGAT GTAGATATAG AAACCCCAGA CACTCATATT 
ACCATAACAT GTCACTTCTA CATCTATATC TTTGGG6TCT GTGAGTATAA 

TCTTACATGC CCACTATTAA GGAAGGTAAC TCACGAGAAC TAATGGGCCA 
AGAATGTACG GGTGATAATT CCTTCCATTG AGTGCTCTTG ATTACCCGGT 

ACAATCTATG CCCAACAGGC CTA ATTACflT TGCTTT TAGG GACAATTTTA 
TGTTAGATAC GGGTTGTCCG GATTAATGTA ACGAAAATCC CTGTTAAAAT 

TTGGTCTAAT GTATTACAAC AGCACGGGTA ATATGGGTGT TCTGGCGGGC 
AACCAGATTA CATAATGTTG TCGTGCCCAT TATACCCACA AGACCGCCCG 

CAAGCATCGC AGTTGAATGC TGTTGTAGAT TTGCAAGACA SAAACACAGA 
GTTCGTAGCG TCAACTTACG ACAACATCTA AACGTTCTGT CTTTGTGTCT 

GCTTTCATAC CAGCTTTTGC TTGATTC-IZAT TGGTGATAGA ACCAGGTACT 
CGAAAGTATG GTCGAAAACG AACTAAGGTA ACCACTATCT TGGTCCATGA 

TTTCTATGTG GAATCAGGCT GTTGACAGCT ATGATCCAGA TGTTAGAATT 
AAAGATACAC CTTAGTCCGA CAACTGrCGA TACTAGGTCT ACAATCTTAA 

ATTGAAAATC ATGGAACTGA AGATGAACTT CCAAATTACT GCTTTCCACT 
TAACTTTTAG TACCTTGACT TCTACTTSAA GGTTTAATGA CGAAAGGTGA 

GGGAGGTGTG ATTAATACAG AGACTCTTAC CAAGGTAAAA CCTAAJACAG 
CCCTCCACAC TAATTATGTC TCTGAGAATG GTTCCATTTT GGATTTTGTC 

GTCAGGAAAA TGGATGGGAA AAAGATGCH-A CAGAATTTTC AGATAAAAAT 
CAGTCCTTTT ACCTACCCTT TTTCTACGAT GTCTTAAAAG TCTATTTTTA 

GAAATAAGAG TTGGAAATAA TTTTGCCATG GAAATCAATC TAAATGCCAA 
CTTTATTCTC AACXZTTTATT AAAACGGTAC CTTTAGTTAG ATTTACGGTT 
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18501 
16951 

laooi 

19051 
19101 
19151 
19201 
19251 
19301 
19351 
19401 
19451 
19501 
19551 
19601 
19651 
19701 
19751 
19801 



CCTGTOGAGA AJ^TCCTGT ACTCCAACAT AGCGCTGTAT TTCCCC^pA 
GGACACCTCT TTAAAGGACA TGA6GTTGTA TCGCGACATA AACGGGCTGT 



AGCTAAAGTA CAGTCCTTCC AACGTAAAAA TTTCTGATAA COCAAACACC 
TCGATTTCAT GTCAQGAAG6 TTGCATTTTT AAAGACTATT GOGTTTGTGG 

TACGACTACA TGAACAAGCG AGTGQTGGCT CCCGOGCTAQ TQGACTGCTA 
ATGCTGATCT ACTTGTTCGC TCACCACCGA GGGCCC6ATC ACC7GACGAT 

CATTAACCTT GGA6CACGCT GGTCCCTTGA CTATATGGAC AACGTCAACC 
6TAATT6GAA CCTCGT6CGA CCAGG6AACT GATATACCTG TTGCAGTTGG 

CATTTAACCA CCACCX3CAAT GCTGOCCTGC 6CTACCGCTC AATGTTGCTG 
GTAAATTGGT GGTGGCCTTA CGACCG6ACG CGATGGCGA6 TTACAACGAC 

GGCAATGGTC GCTATGTGCC CTTCCACATC CAGCTCCCTC AGAAGTTCTT 
CC6TTACCA6 CGATACACGG GAAGGTGTA6 GTCCACGGA6 TCTTCAAGAA 

TGCCATTAAA AACCTCCTTC TCCTGCCG06 CTCATACACC TACGAGT6GA 
ACGGTAATTT TTG6AGGAA6 AGGACGGCOC GAGTAT6TGG ATGCTCACCT 

ACTTCAGGAA GGATGTTAAC ATGGTTCTGC AGAGCTCCCT AGGAAATGAC 
TGAAGTCCTT CCTACAATTG TACCAAGACG TCTCGAGGGA TCCTTTACTG 

CTAAGGGTTG ACGGAGCCAG CATTAAGTTT GATAGCATTT GCCTTTACGC 
GATTCCCAAC TGCCTCGGTC GTAATTCAAA CTATCGTAAA CGGAAATGCG 

CACCTTCTTC CCCATGGCCC ACAACACCGC CTCCACGCTT GAGGCCATGC 
GTGGAAGAAG GGGTACCGGG TGTTGTGGCG GAGGTGCGAA CTCCGGTACG 

TTAGAAACGA CACCAAC6AC CAGTCCTTTA AC6ACTATCT CTCCGCCGCC 
AATCTTTGCT GTG3TTGCTG CTCAGGAAAT TGCTGATAGA 3AGGCGGCGG 

AACATGCTCT ACCCTATACC CCCCAACGCT ACCAACGTGC CCATATCCAT 
TT6TACGAGA TGGGATATGG GCGGTTGCGA TGGTTGCACG GGTATAGGTA 

CCCCTCCC6C AACTGGGCGG CTTTCCGCGG CTGGGCCrrC ACGCGCCTTA 
GGGGAGGGCG TTGACCCGCC GAAAGGCGCC GACCC66AA6 TGCGCG6AAT 

AGACTAAG6A AACCCCATCA CTGGGCTCGG GCTACGACCC TTATTACACC 
TCTGATTCCT TTGGGGTAGT GACCCGA3CC CGATGCTGGG AATAATGTGG 

TACTCTGGCT CTATACCCTA CCTAGATGGA ACCTTTTACC TCAACCACAC 
AT6AGACCGA GATATGGGAT GGATCTACCT TGGAAAATGG AGTTGGTGTG 

CTTTAAGAAG GTGGCCATTA CCTTTGACTC TTCTGTCAGC TGGCCTGGCA 
GAAATTCTTC CACCGGTAAT GGAAACTGAG AAGACAGTCG ACC6GACCG7 

ATGACCGCCT GCTTACCCCC AACGAGTTTG AAATTAAGCG CTCAGTTGAC 
TACTGGCGGA CGAATGGGGG TTGCTCAAAC TTTAATTCGC GAGTCAACTG 

GGGGAGGGTT ACAACGTTGC CCAGT6TAAC ATGACCAAAG ACTGGTTCCT 
CCCCTCCCAA TCTTGCAACG GGTCACATTG TACTGGTTTC TGACCAAGGA 

GGTACAAATG CTAGCTAACT ATAACATTGG CTACCAGGGC TTCTATATCC 
CCATGTTTAC GATCGATTGA TATTGTAACC GATGGTCCCG AAGATATAGG 
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19851 CAGAGAGCTA CJ^ptSACCGC 'ATGTACTCCT TCTTTAGAAA CTTCCA^^pt 
GTCTCTCGAT GTTCCTQGCG TACATGAGGA AOAAATCTTT GAAGGTCQGG 

19901 ATGAGCCGTC AGGT6GTGGA TGATACTAAA TACAA6GACT ACCAACAGQT 
TACTCGGCAG TCCACCACCT ACTATGATTT ATGTTCCTGA TGGTTGTCGA 

19951 GGGCATCCTA CACCAACACA ACAACTCTGG ATTTGTTGGC TACCTTGCCC 
CCCGTAGGAT GTGGTT6TGT TGTTGAGACC 7AAACAACCG ATGGAACGGG 

20001 CCACCATGCG CGAAGGACA6 GCCTACCCTG CTAACTTCCC CTATCCGCTT 
GGTGGTACGC GCTTCCTGTC COGATGGGAC GATTGAA6GG GATAGGCGAA 

20051 AtAGGCAAGA CCGCAGTTGA CAGCATTACC CAGAAAAAGT TTCOTrGCGA 
TATCCGTTCT GGCGTCAACT 6TCGTAATG6 GTCTTTTTCA AAGAAACGCT 

201 01 TCGCACCCTT TGGCGCATCC CATTCTCCAG TAACTTTATG TCCATGGGCG 
AGCGTGGGAA ACXIGCGTAGG GTAAGAGGTC ATTGAAATAC AGGTACCCGC 

20151 CACTCACAGA CCTGGGCCAA AACCTTCTCT ACGCCAACTC CGCCCACGCG 
GTGAGTGTCT 6GACCCGGTT TTGGAAGAGA TGCGGTTGAG GCGGGTGC6C 

20201 CTAGACATGA CTTTTGAGGT GGATCCCATG GACGAGCCCA CCCTTCTTTA 
GATCTGTACT GAAAACTCCA CCTAG6GTAC CTGCTCGGGT GGGAAGAAAT 

20251 TGTTTTGTTT GAAGTCTTTG ACGTGGTCCG TGTGCACCAG CCQCACCGCG 
ACAAAACAAA CTTCAGAAAC TGCACCAGGC ACACGTGGTC GGCGTGGCGC 

20301 GCGTCATCGA AACCGTGTAC CTGCGCACGC CCTTCTCGGC CGGCAACGCC 
CGCAGTAGCT TTCGCACATG GACGCGTGCG GGAAGAGCCG GCCGTTGCGG 

__20351 ACAACATAAA GAAGCAAGC A ACATCAAC AA CAGCTGCCGC CATGGGCTCC 

TGTTGTATTT CTTCGTTCGT TGTAGTTGW GTCGACGGCG GTACCCGAGG 

20401 AGTGAGCAGG AACTGAAAGC CATTGTCAAA GATCTTGGTT GTGGGCCATA 
I TCACTCGTCC TTGACTTTCG GTAACAGTTT CTAGAACCAA CACCCGGTAT 

I 

20451 TTTTTTGGGC ACCTATGACA AGCGCTTTCC AGGCTTTGTT TCTCCACACA 
AAAAAACCCG TGGATACTGT TCGCGAAAGG TCCGAAACAA AGAGGTGTGT 

20501 AGCTCGCCTG CGCCATAGTC AATACGGCCG GTCGCGAGAC TGGGGGCGTA 
TCGAGCGaAC GCGGTATCAG TTATGCCGGC CAGCGCTCTG ACCCCCGCAT 

20551 CACTGGATGG CCTTTGCCTG GAACCCGCAC TCAAAAACAT GCTACCTCTT 
GIX3ACCTACC GGAAACGGAC CTTGGGCGTG A6TTTTTGTA CGATGGAGAA 

20601 TGAGCCCTTT GGCTTTTCTG ACCAGCGACT CAAGCAGGTr TACCAGTTrG 
ACTCGGGAAA CCGAAAAGAC TGGTCGCTGA GTTCGTCCAA ATGGTCAAAC 

20651 A6TACGAGTC ACTCCTGCGC CGTAGCGCCA TTGCTTCTTC CCCCGACCGC 
TCATGCTCAG TG3U3GACGCG GCATCGCGGT AACGAAGAAG 6GGGCTGGC6 

20701 TGTATAACGC TOGAAAAGTC CACCCAAAGC G7ACAGGGGC CCAACTCGGC 
AGATATT6CG ACCTTTTCA6 6TGGGTTTCG CAT6TCCCC6 6GTTGAGCCG 

20751 CGCCTGTOGA CTATTCTGCT GCATGTTTCT CCACGCCTTT GCCAACTGGC 
GCGGACACCT GATAAGACGA CGTACAAAGA GGTGCGGAAA CGSTTGACCG 



aisf. 
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20801 CCCAAACTCC d|^GATCAC AACCCCACCA TGAACCTTAT TACCQG^K" 
•GGGTTTGAGG'GTACCTAGTG TTGGGGTGGT ACTTGGAATA ATGGCCCCAT 

20851 CCCAACTCCA TGCTCAACAG TCCCCAGGTA CAGCCCACCC TGCGTCGCAA 
GGGTTGAGGT ACGAGTTGTC AGGGGTCCAT 6TCG6GTGGG ACGCA6CGTT 

20901 CCAOGAACAG CTCTACAGCT TCCTGGA6CG CCACTCGCCC TACTTCCGCA 
GGTCCTTGTC GAGATGTCGA AGGACCTCQC GGTGAGCGGG ATGAAGGCGT 

20951 GCCACAGTGC GCAGATTAGG AGC6CCACTT CTTTTTGTCA CTTGAAAAAC 
CXXSTGTCACG CGTCTAATCC TCGCGGTGAA GAAAAACAGT SAACTTTTTG 

21001 ATGTAAAAAT AATGTACTAG AGACACTTTC AATAAAGGCA AATGCTTTTA 
TACATTTTTA TTACArTGATC TCTGTGAAAG TTATTTCCGT TTACGAAAAT 

21051 nrCTACACT CTC6GGTGAT TATTTACCCXr CACCCTTGCC GTCTQC6CCG 
AAACATGTGA GAGCCCACTA ATAAATGGG6 GTGGGAACGG CAGACGCGGC 

21101 TTTAAAAATC AAAGGGGTTC TGCCGCGCAT CGCTATGCGC CACTOGCAGG 
AAATTTTTAG TTTCCCCAAG AC6GCGC6TA GCGATACGCG GTGACCGTCC 

21151 GACAC6TTGC GATACT«3TG TTTAGTGCTC CACTTAAACT CAGGCACAAC 
CT6TGCAAC6 CTATGACCAC AAATCACGAG GTGAATTTGA GTCCGTGTTG 

21201 CATCCGCGGC AGCTCGGTGA AGTTTTCACT CCACAGGCTG CGCACCATCA 
GTAGGCGCCG TCGAGCCACT TCAAAAGTGA GGTGTCCGAC GCGTGGTAOT 

21251 CCAACGCGTT TA6CAGGTCG GGCQCCGATA TCTTGAAGTC GCAGTTGGGG 
GGTTGCGCAA ATCGTCCAGC CCGCGGCTAT AGAACTTCAG CGTCAACCCC 

21301 CCTCCGCCCT 6C6CQCGCGA OTTGCGATAC ACAGGGTTGC AGCACTGGAA 
GGAGGCGGGA CGC3CGCGCT CAACGCTATG TGTCCCAACG TCGTGACCTT 

21351 CACTATCAGC GCCGGGTCGT GCACGCTGGC CAGCACGCTC TTGTCGGAGA 
GTGATAGTCG C66CCCACCA CGTGCGACCG GTCGTGCGA6 AACAGCCTCT 

21401 TCAGATCCGC GTCCA6GTCC TCCGCGTTGC TCAGGGCGAA CGGAGTCAAC 
AGTCTAGGCG CAGGTCCAGG A6GCGCAACG AGTCCCGCTT GCCTCAGTTG 

21451 TTTGGTACCT CCCTTCCCAA AAACGGCXSCG TGCCCAGGCT TTGAGTTGCA 
AAACCATCGA CGGAAG6GTT TTTCCCQCGC ACGGGTCCGA AACTCAACGT 

21501 CTCQCACCGT AGTGGCATCA AAAGGTGACC GTGCCCGGTC TGGGCGTTAG 
GAGCGTGGCA TCACCGTAGT TTTCCACTGG CACGGGCCAG ACCCGCAATC 

21551 GATACAGCGC CTGCATAAAA GCCTTGATCT GCTTAAAAGC CACCTGAGCC 
CTATGTCGCG GACGTATTTT CGGAACIAGA CGAATTTTCG GTGQACTCGG 

21601 TTTGCGCCTT CAGAGAAQAA CATGCCGCAA GACTTGCCGG AAAACTGATT 
AAACGdGGAA GTCTCTTCTT CTACGGCGTT CTGAACGGCC TTT3X3ACTAA 

21651 GGCCGGACAG GCCGGGTCGT GCAC6CA0CA CCTTGCGTCG GTGTTGGAGA 
CCGGCCTGTC CSGCGCAGCA CGTOCGTCGT GGAACGCAGC CACAACCTCT 

21701 TCTGCACCAC ATTTCGGCCC CACCGGTTCT TCACGATCTT 6GCCTTGCTA 
A6ACGTQ6TG TAAAGCCGGG GTGGCCAAGA AGTGCTAGAA CCGGAACGAT 
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21751 GACTGCTCCT TMCGCGCG CTGCCOCTTT TCGCTCGTCA CAUtJCA'^: 
CTGACGAGGA AGTCGCGCGC GACGGGCAAA AGCGAGCAGT OTAGGTAAAG 

21801 AATCACGTGC TCCTTATTTA TCATAATGCT TCCX3TGTAGA CACTTAAGCT 
TTAGTGCACG AQGAATAAAT AGTATTACGA AGGCACATCT GTGAATTC6A 

21851 CGCCTTC6AT CICAGCQCAG CQQTGCAGCC ACAACGC6CA GCCCGTGGGC 
GCGGAA6CTA GAGTCGCGTC GCX^iCGTCGG TGTTGCGCGT CGGGCACCCG 

21901 TCGTGATGCT TGTAGGTCAC CTCTGCAAAC 6ACTGCAGGT ACGCCTGCAG 
AGCACTACGA ACATCCAGTG GAGACGT7TG CTGACGTCCA rTGCGGACGTC 

21951 GAATCGCCCC ATCATCGTCA CAAAGGTCTT GTTGCTGGTG AAGGTCAGCT 
CTTAGCG6GG TAG^AGCAGT GTTTCCA6AA CAACGACCAC TTCCAGTCGA 

22001 6CAACCCCCC CTGCTCCTCC TTCAGCCA6G TCTTGCATAC GGCCGCCAGA 
CGTTGGGCGC CACGAGGAGC AAGTC6GTCC AGAACGTATG CCGGCGGTCT 

22051 GCTTCCACTT GGTCAGGCA6 TAGTTTGAAG TTCGCCTTTA GATCGTTATC 
CGAAGGTGAA CCAGTCCGTC ATCAAACTTC AAGCGGAAAT CTAGCAATAG 

22101 CACGTGGTAC TTGTCCATCA GCGCGCGCGC AGCCTCCATG CCCTTCTCCC 
6TGCACCATG AACAQGTAGT CGCGCGCGCG TCGGAGGTAC GGGAAGAGGG 

22151 ACGCAGACAC GATCGGCACA CTCAGCGGGT TCATCACCGT AATTTCACTT 
TGCGTCTGTG CTAGCCGTGT GAGTCGCCCA AGTAGTGGCA TTAAAGTGAA 

22201 TCCGCTTCGC TGGGCTCTTC CTCTTCCTCT TGCGTCCGCA TACCACGCGC 
AGGCGAAGCG ACCCGAGAAG GAGAAGGAGA ACGCAGGCGT ATGGTGCGCG 

22251 CACTGGGTCG TCTTCATTCA GCCGCCGCAC TGTGCGCTTA CCTCCTraGC 
GTGACCCAGC AGAAGTAAGT CGGCGGCGTG ACACGCGAAT GGAGGAAACG 

22301 CATGCTTGAT TAGCACCGGT GGGTTGCTGA AACCCACCAT TTGTAGCGCC 
GTACGAACTA ATCGTGGCCA CCCAACGACT TTGGGTGGTA AACATCGCGG 

22351 ACATCTTCTC TTTCTTCCTC GCTGTCCACG ATTACCTCTG GTGATGGCGG 
TGTAGAAGAG AAAGAAGGAG CGACAGGTGC TAATGGAGAC CACTACCGCC 

22401 GCGCTCGGGC TTGGGAGAAG GGCGCTTCTT TTTCTTCTTG GGCGCAATGG 
CGCGAGCCCG AACCCTCTTC CCGCGAAGAA AAAGAAGAAC CCGCGTTACC 

22451 CC AAATCCGC CGCCGAGGTC GATGGCCGCG GGCTGGGTGT GCGCGGCACC 
GGTTTAGGCG GCGGCTCCAG CTACCGGCGC CCGACCCACA CGC6CCGTGG 

22501 AGCGCGTCTT GTGATGAGTC TTCCTCGTCC TCG6ACTCGA TACGCCGCCT 
- TCGCGCAGAA CACTACTCAG AAGGAGCAGG AGCCTGAGCT ATGCGGCGGA 

22551 CATCCGCTTT TTTGGGGGCG CCCGGGGAGG CQGCGGCGAC QGGGACGGGG 
GTAGGCGAAA AAACCCCCGC GGGCCCCTCC GCTGCCGCTG CCCCTGCCCC 

22601 ACGACACGTC CTCCATGGTT GGGGGACGTC GCXK:CGCACC GCGTCCGCGC 
T6CTGTGCAG GAGSTACCAA CCCCCTGCA6 CGCGGCGTGG CGCAGGCGC6 

22651 TCGGGGGTGG TTTCGCGCTG CTCCTCTTCC C6ACT0GCCA TTTCCTTCTC 
AGCCCCCACC AAAGC6CGAC GAGGAGAAGG GCTGACCGGT AAAGGAAGAG 
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22701 
22751 
22801 
22851 
22901 
22951 
23001 
23051 
23101 
23151 
23201 
23251 
23301 
23351 
23401 
23451 
23501 
23551 
23601 



GATATCCGTC TTTTTCTAQT ACCTCAOTCA GCTCTTCTTC CTGTCGGATT 

CCGCCCCCTC TGACTTCGCC ACCACCGCCT CCACCGATGC C0CCAAC6CG 
GGCGGG6GAG ACTCAAGCGG TGGTGGCGGA GGTGGCTACG GC6G7TGCGC 

CCTACCACCT TCCCCGTCGA GGCACCCCCG CTT6AGGAGG A6GAAGTGAT 
GGATGGTGGA AG6GGCAGCT CCGTGGGGGC GAACTCCTCC TCCTTCACTA 

TATCGAGCAG 6ACCCAGGTT TTGTAAGCGA AGACGACGA6 GACCGCTCAG 
ATAGCTC6TC CTCGGTCCAA AACATTCCCT TCT6CTGCTC CTGOCGAGTC 

TACCAACAGA GGATAAAAAG CAAGACCAGG ACAACGCAGA GGCAAACGAG 
ATGGTTGTCT CCTATTTTTC GTTCTGGTCC TGTTGCGTCT CCGTTTGCTC 

GAACAAGTCG GGCGGGGGGA CGAAAG6CAT GGCGACTACC TAGATGTGGG 
CTTCTTCAGC CCGCCCCCCT GCTTTOCGTA CCGCTGATCC ATCTACACCC 

AGACGACGTG CTGTTGAAGC ATCTGCAGC6 CCAGTGCGCC ATTATCTGCG 
TCTGCTGCAC 6ACAACTTCG TAGACGTCGC GGTCACGCGG 7AATAGACGC 

ACGCGTTGCA AGAGCGCAGC 6ATGTGCCCC TCGCCATAGC GGATGTCAGC 
TGCGCAACGT TCTCGCGTC6 CTACAOGGGG AGCGGTATCG CCTACAGTCG 

CTTGCCTACG AACGCCACCT ATTCTCACCC CGCGTACCCC CCAAACGCCA 
GAACGGATGC TTGCGGT6GA TAAGAGTOGC GC6CATGGGG OGTTTGCGGT 

AGAAAACGGC ACATGC6AGC CCAACOCGCG CCTCAACTTC TACCCCGTAT 
TCTTTTGCCG TGTACGCTCG GGTTGG6CGC GGAGTTGAAG ATGGGGCATA 

TTGCCGTGCC AGAGGTGCTT GCCACCTATC ACATCTTTTT CCAAAACTGC 
AACGGCACGG TCTCCACGAA CGGTGGATAG TGTAGAAAAA GGTTTTGACG 

AA6ATACCCC TATCCTGCC6 TGCCAACCGC AGCCGAGCGG ACAAGCAGCT 
TTCTATGGGG ATAGGACGGC ACGGTTGGC6 TCGGCTCGCC TGTTCGTCGA 

GGCCTTGCGG CMGGCGCTG TCATACCTGA TATCGCCTC6 CTCAACGAAG 
CCGGAACGCC GTCCCGCGAC AGTATGGACT ATAGCGGAGC GAGTTGCTTC 

TGCCAAAAAT CTTTGAGGGT CTTGGACGCG ACGAGAAGCG CGCGGCAAAC 
ACGGTTTTTA 6AAACTCCCA GAACCTGCGC TGCTCTTCGC GCGCCGTTTG 

GCTCTGCAAC AGGAAAACAG CGAAAATGAA AGTCACTCTG GAGTGrTGGT 
CGAGACGTTG TCCTTTTGTC GCTTTTACTT TCAGTGAGAC CTCACAACCA 

GGAACTCGAG GGTGACAACG CGCGCCTA6C CGTACTAAAA CGCAGCATCG 
CCTT6AGCTC CCACTGTTGC GCGCGGATCG GCATGATTTT GCGTCGTAGC 

AGGTCACCCA CTTTGCCTAC CCGGCACTTA ACCTACCCCC CAAGCTCATG 
TCCAGTGGGT GAAACGGATG GOCCGTGAAT TGGATGGGGG GTTCCAGTAC 

AGCACAGTCA TGAGTGAGCT GATCGTGCGC CGTGCGCAGC CCCTGGAGAG 
TCGTGTCAGT ACTCACTCGA CTAGCACGCG GCACGCGTCG GGGACCTCTC 

GGATGCAAAT TTGCAAGAAC AAACAGAGGA GGGCCTACCC GCAGTTGGCG 
CCTACGTTTA AACGTTCTTG TTTGTCTCCT CCCGGATGGG CGTCAACCGC 
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23651 ACGAGCA6CT AI^CGCTGG CTTCAAACGC GCGAGCCTGC CGACTT^pC 
TGCTCGTCGA TCGCGCGACC GAAGTTT<3CG CGCTCGGACG GCTGAACCTC 

23*701 6AGCGAC6CA AACTAATGAT GGCCGCAQTG CTCGTTACCG TG6AGCTTGA 
CTCGCTGCGT TTGATTACTA CCGGCGTCAC GAGCAATGGC ACCTCGAACT 

23*751 GTGCATGCAG CGGTTCTTTG CTGACCC6GA GATGCAGCGC AAGCTAGAOG 
CACGTACGTC GCCAAGAAAC GACTGGQCCT CTACGTCGCG TTCGATCTCC 

23801 AAACATTGCA CTACACCTTT CGACAGGGCT ACGTACGCCA GGCCTGCAAG 
TTTGTAACGT 6ATGTGGAAA GCTGTCCX:GA TGCATGCGGT CCGQACGTTC 

23851 atctccaacg tggagctctg caacctogtc tcctaccttg gaattttgca 
tagaggttgc acctcgagac gttggacx:ag aggatggaac cttaaaacgt 

23901 cgaaaaccgc cttgggcaaa acgtgcttca ttccacgctc aagggcgagg 
gcttttggcg gaacccgttt tgcacgaagt aaggtgcgag ttcccgctcc 

23951 CGCGCCGCGA CTACGTCCGC GACTGC5TTT ACTTATTTCT ATGCTACACC 
GCGCGGCGCT GATGCAGGC6 CTGACGCAAA TGAATAAAGA TACGATGTGG 

24001 TGGCAGACGG CCATGSGCGT TTGGCAGCAG TGCTTGGAGG AGTGCAACCT 
ACCGTCTGCC GGTACCC6CA AACCGTCGTC ACGAACCTCC TCACGTTGGA 

24051 CAAGGAGCTG CAGAAACTGC TAAAGCAAAA CTTGAAGGAC CTATCGACGG 
GTTCCTCGAC 6TCTTTGACG ATTTCGTTTT GAACTTCCTG GATACCTGCC 

24101 CCTTCAACGA GCGCTCCGTG GCCGCGCACC TGGCGGACAT CATTTTCCCC 
GGAAGTTGCT CGCX3AGGCAC CGGCGCGTGG ACCGCCTGTA GTAAAAGGGG 

24151 GAACGCCTGC TTAAAACCCT GCAACAGGGT CTGCCAGACT TCACCAGTCA 
— - CTTGCGGACG AATTTfGGGA CGTOGtCCCA GACGGTCTGA^GTGGTCAGT 

24201 AAGCATGTTG CAGAACTTTA GGAACTTTAT CCTAGAGCGC TCAGGAATCT 
TTCGTACAAC GTCTTGAAAT CCTTGAAATA GGATCTCGCG AGTCCTTAGA 

24251 TGCCCGCCAC CTGCTGTGCA CTTCCTAGCG ACTTTGTGCC CATTAAGTAC 
ACGGGCGGTG GACXyiCACGT GAAGGATCGC TGAAACACGG GTAATTCATG 

24301 CGCGAATGCC CTCCGCCGCT TTGGGGCCAC TGCTACCTTC TGCAGCTAGC 
GCGCTTAC6G GAGGCGGCGA AACCCCGGTG ACGATGGAAG ACGTCGATCG 

24351 CAACTACCTT GCCTACCACT CTGACATAAT GGAAGACGTG AGCGGTGACG 
GTTGATGGAA CGGATGGT6A GACTGTATTA CCTTCTGCAC TCGCCACTGC 

24401 GTCTACTGGA GTGTCACTGT CGCTGCAACC TATGCACCCC GCACCGCTCC 
CAGATGACCT CACAGTGACA GCGACGTTGG ATACGTGGGG CGTGGCGAGG 

24451 CTGGTTTGCA ATTCGCAGCT GCT1!AACGAA AGTCAAATTA TCGGTACCTT 
GACCAAACGT TAAGC6TCGA CGAATTGCTT TCA6TTTAAT AGCCAT6GAA 

24501 TGAGCTGCAG GGTCCCTCGC CTGACGA?«A GTCCGCGGCT DCGGGGTTGA 
ACTCGAC6TC CCAG6GAGCG GACTGCTTXT CAGGCGCCXSA GGCCCCAACT 

24551 AACTCACTCC G6GGCTGTG6 ACGTCGGCTT ACCTTCGCAA ATTTGTACCT 
TTGACTGAGG CCCCGACACXT TGCAGCC6AA TGGAAGCGTT TAAACATGGA 




<P7 2 



121/144 



wo 02/22080 



PCT/USOl/28861 



24601 
24651 
24701 
24751 
24801 
24851 
24901 
24951 
25001 
25051 
25101 
25151 
25201 
25251 
25301 
25351 
25401 
25451 
25501 



GAGGACTACC AW^CCACGA GATTAC3GTTC TACGAAGACC AATCCCfll^C 
CTCCTGATGG TGCGGGTGCT CTAATCCAA6 ATGCTTCTGG TTAGGGCGGG 

GCCTAATGCG GAGCTTACCG CCTGCGTCAT TACCCAGGGC CACATTCTTG 
CGGATTACGC CTCGAATGGC GGACGCAGTA ATG6G7CCCG GTG7AAGAAC 

GCCAATTGCA AGCCATCAAC AAAGCCCGCC AAGAGTTTCT GCTACGAAAG 
CGGTTAACGT TCGGTAGTTG TTTCGGGCGG TTCTCAAAGA CGATGCTTTC 

GGACGGGGGG TTTACTTGGA CCCCCAGTCC GGCGAGGAGC TCAACCCAAT 
CCTGCCCCCC AAATGAACCT GGGGGTCAGG CCGCTCCTCG AGTTGGGTTA 

CCCCCCGCCG CCGCAGCCCT ATCAGCAGCA GCCGCGGGCC CTTGCTTCCC 
GGGGGGC6GC GGCGTCGGGA TAGTC6TC6T CGGC6CCC6G GAACGAAGGG 

AGGATGGCAC CCAAAAAGAA GCTGCAGCTG CCGCCGCCAC CC^CGGACGA 
TCCTACCGTG GGTTTTTCTT CGACGTCGAC GGCGGCGGTG G6TGCCTGCT 

GGAGGAATAC TGGGACAGTC AGGCAGAGGA GGTTTTGGAC GAGGAGGAGG 
CCTCCTTATG ACCCTGTCAG TCCGTCTCCT CCAAAACCTG CTCCTCCTCC 

AGGACATGAT GGAAGACTGG GAGAGCCTAG ACGAGGAAGC TTCCGAGGTC 
TCCTGTACTA CCTTCTGACC CTCTCGGATC TGCTCCTTCG AAGGCTCCAG 

GAAGAGGTGT CAGACGAAAC ACCGTCACCC TCGGTCGCAT ?CCCCTCGCC 
CTTCTCCACA GTCTGCTTTG TGGCAGTGGG AGCCAGCGTA AG3GGAGCGG 

GGCGCCCCAG AAATCGGCAA CCGGTTCCAG CATGGCTACA ACCTCCGCTC 
CCGCGGGGTC TTTAGCCGTT GGCCAAGGTC GTACCGATGT TG-GAGGCGAG 

CTCAGGCGCC GCCGGCACTG CCCGTTCGCC 6ACCCAACCG TA3ATGGGAC 
GAGTCCGCGG CGGCCGT6AC GGGCAAGCGG CTGGGTTGGC ATCTACCCTG 

ACCACTGGAA CCAGGGCC66 TAAGTCCAAG CAGCCGCCGC CGTTAGCCCA 
TGGTGACCTP GGTCCCGGCC ATTCAGGTTC GTCGGCG6CG GCAATCGGGT 

AGAGCAACAA CAGC6CCAAG GCTACCGCTC ATGGCGCGGG C^IAAGAACG 
TCTCGTTGTT GTCGCGGTTC CGATGGCGAG TACCGCGCCC GTSTTCTTGC 

CCATAGTTGC TTGCTTGCAA GACTGTGGGG GCAACATCTC CTTCGCCCGC 
GGTATCAACG AACGAACGTT CTGACACCCC CGTTGTA6AG GAAGCG6GCG 

CGCTTTCTTC TCTACCATCA CGGCGTGGCC TTCCCCCGTA ACATCCTGCA 
6CGAAAGAAG AGATGGTAGT GCCGCACC66 AAGGGGGCAT TGTAGGACGT 

TTACTACCGT CATCTCTACA GCCCATACTG CACC6GCGGC AGCG6CAGCA 
AATGAT6GCA 6TAQA6ATGT CGGGTATQAC 6TGGCCGCCG TCGCCGTCGT 

ACAGCAGCGG CCACACAGAA 6CAAAGGCGA CC6GATAGCA AGACTCTGAC 
TGTCGTCGCC GGTGTGTCTT CGTTTCC6CT GGCCTATCGT TCTGAGACTG 

AAAOCCCAAG AAATCCACAG CGGCGGCAGC AGCA6GA6GA GGAGCGCT6C 
TTTCGGGTTC TTTAGGTGTC GCCGCCGTCO TCGTCCTCCT CCTCQCGACG 

GTCTGGCGCC CAACGAACCC GTATCGACCC GCGAOCTTAG AAACAGGATT 
CAGACCCCGG CTTOCTTGCG CATAGCTGGG CGCTCGAATC TTTGTCCTAA 
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25551 TTTCXrCACTC tIB^gctat atttcaacag aocaggggcc aagaacJIBI^ 

AAAGGGTGAG ACATACGATA rAAAGTTGTC TCGTCCCCGG TTCTTGTTCT 

25601 GCTGAAAATA AAAAACAGGT CTCTGCGATC CCTCACCCGC AGCTGCCTGT 
CGACTTTTAT TTTTTGTCCA GAGACGCTAG G6AGTGGGCG TC6AC6GACA 

25651 ATCACAAAA6 CGa^GATCAG CTTCGGC6CA COCTGGAftGA CGCGGAG6CT 
TA6TGTTTTC GCTTCTAGTC GAAGCCGCGT GCGACCTTCT GCGCCTCCGA 

25701 CTCTTCACTA AATACTGCGC GCTGACTCTT AAGGACTAGT TTCGCGCCCT 
GAGAAGTCAT TTA7GACGCG CGACTGAGAA TTCCTGATCA AAGCGCGGGA 

25751 TTCTCAAATT TAAGCGCGAA AACTAOGTCA TCTCCAGCGG CCACACCC6G 
AAGAG7TTAA ATTCGCGCrr TTGATGCAGT AGAGGTC6CC GGTGT6GGCC 

25801 CGCCAGCACC TGTTGTCAGC GCCATTATCA GCAAGGAAAT TCCCACGCCC 
6CGGTCGTGG ACAACAGTCG CGGTAftTACT CGTTCCTTTA AGGGTGCGG6 

25851 TACATGTGGA 6TTACCAGCC ACAAATGGGA CTTGCGGCTG GAGCTGCCCA 
ATGTACACCT CAATOGTCGG TGTTTACCCT GAACGCCGAC CTCGACGGGT 

25901 A6ACTACTCA ACCCGAATAA ACTACATGAG CGCGGGACCC CACATGATAT 
TCTGATGAGT TGGGCTTATT TGATGTACTC GCGCCCTGGG GT3TACTATA 

25951 CCCGGGTCAA CGGAATACGC GCCCACCGAA ACC6AATTCT CCTGGAACAG 
GGGCCCAGTT GCCTTATGC6 CGGGTGGCTT TGGCTTAAGA GGACCTTGTC 

26001 6CGGCTATTA CCACCACACC TCGTAATAAC CTTAATCCCC GTAGTTGGCC 
CGCCGATAAT G6T6GTGTGG AGCATTATTG GAATTAGGGG CATCAACCGG 

26.051 CGCTGCCCTG_GTCTACCAGG„AAAGTCCCG^ TCCCACC ACT GT GG TACT TC 

GC6ACGGGAC CACATGGTCC TTTCA6GGCG AGGGTGGT6A CACCATGAAG 

26101 CCAGAGACGC CCAGGCCGAA GTTCAGA'TGA CTAACTCAGG GGCGCAGCTT 
GGTCTCTGC6 GGTCCGGCTT CAAGTCTACT GATTGAGTCC CCX3CGTCGAA 

26151 GCGGGCGGCT TTCGTCACAG GGTGCGGTCG CCCGGGCAGG GTATAACTCA 
CGCCCGCCGA AAGCAGTGTC CCACGCCAGC GGGCCCGTCC CATATTGAGT 

26201 CCTGACAATC AGAGGGCGAG GTATTCAGCT CAACGACGAG TCGGTGAGCT 
GGACTGTTAG TCTCCCGCTC CATAAGTCGA GTTGCTGCTC AGCCACTCGA 

26251 CCTCGCTTGG TCTCCGTCCG 6ACGGGACAT TTCAGATCGG CGGCGCCGGC 
6GAGCGAACC AGAGGCAGGC CTGCCCTGTA AAGTCTAGCC GCCGCGGCCG 

26301 CGCTCTTCAT TCACGCCTCG TCAGGCAATC CTAACTCTGC AGACCTCGTC 
GCGAGAAGTA AGT6CGGAGC A6TCCGTTAG GATTGAGACG TCTGGAGCAG 

26351 CTCTGAGCCG CGCTCTGGAG GCATTGGAAC TCTGCAATTT ATTGAGGAGT 
GAGACTCGGC GC6AGACCTC CGTAACCTTG AGACGTTAAA TAACTCCTCA 

26401 TTGTGCCATC GGTCTACTTT AACCCCTTCT CGGGACCTCC CGGCCACTAT 
AACAC66TAG CCAiGATGAAA TTGGGGAAGA GCCCTGG^kGG 6DCGGTGATA 

26451 CCGGATCAAT TTATTCCTAA CTTTGACGCG GTAAAGGACT CGGCGGACGG 
GGCCTAGTTA AATAAGGATT GAAACTGCGC CATTTCCTGA GCCGCCTGCC 




123/144 



wo 02/22080 



PCT/USOl/28861 



26501 

» 

26551 
26601 
26651 
26701 
26751 
26801 
26B51 
26901 
26951 
27001 
27051 
27101 
27151 
27201 
27251 
27301 
27351 
27401 



•CTACGACTGA ^i^TAAGTG GAGAGGCAGA GCAACTGCGC .CTGAAA^C 
GATGCTGACT TACAATTCAC CTCTCCGTCT CGTTGACGCG GACTTTGTGG 

TGGTCCACTG TCGCCGCCAC AAGTGCTTrQ CCCGCGACTC CGGTCAGTTT 
ACCAGGTGAC AGCGGC6GTG TTCACGAAAC GGGCGCTGAG GCCACTCAAA 

TGCTACTTTG AATTGCCCGA GGATCATATC GAGGGCCCGG CGCACGGCGT 
ACGATGAAAC TTAACGGGCT CCTAGTATAG CTCCCGGGCC GCGTGCCGCA 

CCGGCTTACC GCCCAGGGAG AGCTTGCCCG TAGCCTGATT CQGGAGTTTA 
GGCCGAATG6 CGGGTCCCTC TCGAACGGGC ATCGGACTAA GCXICTCAAAT 

CCCAGCGCCC CCTGCTAGTT GAGCGGGACA 6GGGACCCTG TGTTCTCACT 
GGGTCGCGGG GGACGATCAA CTCGCCCTGT CCCCTGGGAC ACAAGAGTGA 

GTGATTTGCA ACTGTCCTAA CCCTCGATTA CATCAAGATC TTTCTTGCCA 
CACTAAACGT TOACAGGATT GGGACCTAAT 6TAGTTCTAG AAACAACGGT 

TCTCTGTGCT GAGTATAATA AATACAGAAA TTAAAAIATA CTGGGGCTCC 
AGAGACACGA CTCATATTAT TTATGTCTTT AATTTTATAT GftCCCCGAGG 

TATCGCCATC CT6IAAACGC CACCGTCTTC ACCCGCCCAA GCAAACCAAG 
ATAGCGGTAG GACATTTGCG GTGGCAGAAG TGGGCGGGTT CGTTTOGTTC 

GCGAACCTTA CCT3GTACTT TTAACATCTC TCCCTCTGTG ATTTACAACA 
CGCTTGGAAT GGACXATGAA AATTGTAGAG AGGGAGACAC TAAATGTTGT 

GTTTCAACCC AGACG6AGTG AGTCTACGAG AQAACCrcTC CGAGCTCAGC 
CAAAGTTGGG TCT3CCTCAC TCAGATGCTC TCTTGGAGAG GCTCGAGTCG 

TACTCCATCA GAAAAAACAC CACCCTCCTT ACCTGCCGGO AACGTACGAG 
ATGAGGTAGT CTTTTTTGTG GTGGGAGGAA TCGACCGCCC TTGCATGCTC 

TGCGTCACCG GCCGCTCCAC CACACCTACC GCCTGACCGT AAACCAGACT 
ACeCAGTGGC CGGCGACGTG GTGTGGATGG CGGACTGGCA TTTGGTCTGA 

TTTTCCGGAC AGACCTCAAT AACTCTGTTT ACCAGAACAG GAGGTGAGCT 
AAAAGGCCTG TCTGGAGTTA TTGAGACAAA TGGTCTTGTC CTCCACTCGA 

TAGAAAACCC TTAGGGTATT AGGCCAAAX^ COCAGCTACT GO'GGGGTTTA 
ATCTTTTGGG AATCCCATAA TCCGGTTTCC GCGTCGATGA CACCCCAAAT 

TOAACAATTC AAGCAACTCT ACGGGCTATT CTAATTCAGG TTTCTCTAGA 
ACTTGTTAAG TTCGTTGAGA TGCCCGATAA GATTAAG7CC AAAGAGATCT 

ATCGGGGTTG GGGTTATTCT CTGTCTTGTG ATTCTCTTTA TTCTTATACT 
TAGCCCCAAC CCCAATAAGA GACAGAAOAC TAAGAGA^T AAGAATATGA 

AACGCTTCTC TGCCTAAGGC TCGCC6CCTG CTGTGTGCAC ATTTQCATTT 
TTGCGAA6AG ACGGATTCCG AGCGGCGGAC GACACACGT6 TAAACGTAAA 

ATTGTCAGCT TTTTAAACGC TGGGGTCGCC ACCCAAGAT6 ATTAGGTACA* 
TAACAGTCGA AAAATTTOCG ACCCCASCGG TGGGTTGTAC TAATCCATGT 

TAAICCTACC TTTACTCACC CTTGCGTCAG CCCACGGTAC CACCCAAAAG 
ATTAGGATCC AAAT6AGTGG GAACGCAGTC G66TGCCAT6 GTGGGTTTTC 
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27451 GTGGATTTTA- A^PSCCAGC CTQTAATGTT ACATTCGCAG CTGAAG^CV . 
CACCTAAAAT TCCTCGGTCG GACATTACAA TGTAAGCGTC GACTTCGATT 



27501 TGAGTGCACC ACTCTTATAA AATGCACCAC AGAACATGAA AAGCTGCTTA 
ACTCACGTGG TGAGAATATT TTACGTGGTG TCTTGTAC1T TTCGACGAAT 

27551 TTCGCCACAA AAACAAAATT GGCAAOTATG CTGTTTATGC TATTT6GCAG 
AAGCGGTGTT TTTGTTTTAA CCGTTCATAC GACAAATACG ATAAACC6TC 

27601 CCAQGTGACA CTACAGAGTA TAATGTTACA GTTTTCCAGG GTAAAAGTCA 
GGTCCACTGT GATOTCTCAT ATTACAATGT CAAAAGGTCC CATTTTCAGT 

27651 TAAAACTTTT ATGTATACTT TTCCATTTTA TGAAATGTOC GACATTACCA 
ATTTTGAAAA TACATATGAA AAGGTAAAAT ACTTTACACG CTGTAATGGT 

27701 TGTACATGAG CAAACAGTAT AAGTTGTG6C CCCCACAAAA TTCTCTGCAA 
ACATGTACTC GTTT6TCATA TTCAACACCG GGGGTGTTTT AACACACCTT 

27751 AACACTGGCA CTTTCTGCTG CACTGC3ATG CTAATTACAG TGCTCGCTTT 
TTGTGACCGT GAAAGACGAC GTGAC6A7AC GATTAAT6TC ACGAGCGAAA 

27801 GGTCTGTACC CTACTCTATA TTA7ATACAA AAGCAGACGC AGCTTTATTG 
CCAGACATGG GATGAGATAT AATTTATGTT TTCGTCTGCG TCGAAATAAC 

27851 AGGAAAAGAA AATGCCTTAA TTTACTAAGT TACAAAGCTA ATGTCACCAC 
TCCTTTTCTT TTACGGAATT AAATGATTCA ATGTTTCGAT TACAGTGGTG 

27901 TAACTGCTTT ACTCGCTGCT TGCAAAACAA ATTCAAAAAG TTAGCATTAT 
ATTGACGAAA TGAGCGACGA ACGTTTTGTT TAAGTTTTTC AATCGTAATA 

279.51. AATTAGAATAJSGATTTAAAC CCCCCQGTCA. TTTCCTGCTC AATACCATTC 
TTAATCTTAT CCTAAATTTG GGGGGCCAGT AAAGGACGAG TTATGGTAAG 

28001 CCCTGAACAA TTGACTCTAT GTGGGATATG CTCCAGCGCT ACAACCTTGA 
GGGACTTGTT AACTGAGATA CACCCTATAC GAGGTCGCGA TGTTGGAACT 

28051 AGTCAGGCrr CCTGGATGTC AGCATCTGAC TTTG6CCAGC ACCTGTCCCG 
TCAGTCCGAA GGACCTACAG TCGTAGACTG AAACCGGTCG TGGACAGGGC 

28101 CGGATTTGTT CCAGTCCAAC TACAGCGACC CACCCTAACA GA3ATGACCA 
GCCTAAACAA GGTCAGGTTG ATGTCGCT6G GTGGGATTGT CTCTACTGGT 

28151 ACACAACCAA CGCGGCCGCC GCTACCGGAC TTACATCTAC CACAAATACA 
TGTGTTGGTT GCGCCGGCGG CGATGGCCTG AATGTAGATG GTGTTTATGT 

28201 CCCCAAGTTT CTCCCTTTGT CAATAACTGG GATAACTTGG GCATGTGGTG 
GGGGTTCAAA 6ACGGAAACA GTTATTGACC CTATTGAACC CGTACACCAC 

28251 GTTCTCCATA GCGCTTATGT TTGTATGCCT TATTATTATG TGGCTCATCT 
CAAGAGGTAT CGCGAATACA AACATACGGA ATAATAATAC ACCGAGTAGA 

28301 GCTCCCTAAA GCGCAAACGC GCXTCGACCAC CCATCTATAG TCCCATCATT 
CGACGGATTT CGCGTTTGCG CGGGCTGGT6 GGTAGATATC AGGGTAGTAA 

28351 GTGCTACACC CAAACAATGA TGGAATCCAT AGATTGGACG GACTGAAACA 
CACGATGTGG GTTTGTTACT ACCTTAGGTA TCTAACCTGC CTSACTTTGT 
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28401 CATGTTCTTT T^BtTACAG TATGATTAAA TGAGACATGA TTCCTO^Bf 
GTACAAGAAA AotRaTGTC ATACTAATTT ACTCTGTACT AAGGAGCTCA 

2B451 TTTTATATTA CTGACCCTTG TTGCGCTTTT TTGTGCGTGC TCCACATTGG 
AAAATATAAT GACTGGGAAC AAC6CGAAAA AACACGCACG AGGTGTAACC 

2 85 01 CTGCOGTTTC TCACATCGAA GTAGACTGCA TTCCAGCCTT CACAGTCTAT 
GACGCCAAAG AGTGTAQCTT CATCTGACGT AAGGTCGGAA GTGTCAGATA 

28551 TTGCTTTACG GATTTGTCAC CCTCACGCTC ATCTGCAGCC TCATCACTGT 
AACGAAATGC CTAAACAGTG OGAGTGCGAG TAGACGTCGG AGTAGTGACA 

28601 GGTCATCGCC TTTATCCAGT GCATTGACTG G6TCTGTGTG CGCTTTGCAT 
CCAGTAGCGG AAATAGGTCA CGTAACT6AC CCAGACACAC GCGAAACGTA 

28651 ATCTCAGACA CCATCCCCAG TACAGGGACA GGACTATAGC T6AGCTTCTT 
TAGAGTCTGT GGTAGGGGTC ATGTCCCTGT CCTGATATCG ACTCGAAGAA 

28701 AGAATTCTTT AATTATGAAA TTTACTGTGA CTTTTCTGCT GATTATTTGC 
TCTTAA6AAA TTAATACTTT AAATGACACT 6AAAAGAC6A CTAATAAACG 

28751 ACCCTATCTG CGTTTTGTTC CCCGACCTCC AAGCCTCAAA 6ACATATATC 
TGGGATAGAC GCAAAACAAG GGGCTGGAGG TTCGGAGTTT CTGTATATAG 

28801 ATGCAGATTC ACTCGTATAT GGAATATTCC AAGTTGCTAC AATGAAAAAA 
TAC6TCTAAG TX3AGCATATA CCTTATAAGG TTCAACGATG TTACTTTTTT 

28851 GCGATCTTTC CGAAGCCTGG TTATATGCAA TCATCTCTGT TATGGTGTTC 
CGCTAGAAAG GCTTCGGACC AATATACGTT AGTAGAGACA ATACCACAAG 

28901 TGCAGTACCA TCTTAGCCCT AGCTATATAT CCCTACCTTG ACATTGGCTG 
ACGTCATGGT AGAATCGGGA TCGATATATA GGGATGGAAC TGTAACCGAC 

28951 GAACGCAATA GATGCCATGA ACCACCCAAC TTTCCCCGCG CCCGCTATGC 
CTTGCGTTAT CTACG6TACT TGGTGGGTTG AAAGGGGCGC GGGCGATACG 

29001 TTCCACTGCA ACAAGTTGTT GCCGGCGGCT TTGTCCCAGC CAATCAGCCT 
AAGGTGACGT TGTTCAACAA CGGCCGCCGA AACAGGGTCG GTTAGTCGGA 

29051 CGCCCACCTT CTCCCACCCC CACTGAAATC AGCTACTTTA ATCTAACAGG 
GCGGGTGGAA GAG3GTGGCC CTGACTTTAG TCGATGAAAT TAGATTGTCC 

29101 AGGAGATGAC TGACACCX:TA GATCTAGAAA TGGACGGAAT TATTACAGAG 
TCCTCTACTG ACTGTGGGAT CTAGATCTTT ACCTGCCTTA ATAATGTCTC 

29151 CAGCGCCTGC TAGAAAGACG CAGGGCAGCG OCCGAGCAAC AGCGCATGAA 
GTCGCG6ACG ATCTTTCTGC GTCCCGTCGC CG6CTCGTT6 TC6CGTACTT 

29201 TCAAGAGCTC CAAGACATGG TTAACTTGCA CCAGTGCAAA AGGGGTATCT 
AGTTCTCGAG GTTCTGTACC AATTGAACGT GGTCACGrTT TCCCCATAGA 

29251 TTTGTCTCGT AAAGCAGGCC AAAGTCACCT ACGACAGTAA TACCACCGGA 
AAACAGAGCA TTICGTCCGG TTTCAGTGGA TGCT6TCATT ATGGTGGCCT 

29301 CACCGCCTTA GCTACAAGTT GCCAACCAAG CGTCA6AAAT TGGTGGTCAT 
GTCGCGGAAT CGATGTTCAA CGGTTGGTTC GCAGTCTTTA ACCACCAGTA 
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29351 GGT6G6AGAA Ji^^l^CATTA CCATAACTCA 6CACTC6GTA G;^CC:^pb 
CCACCCTCTT TTCGGQTAAT GGTATTGAGT CGTGAGCCAT CTTTGGCTTC 

29401 GCTGCATTCA CTCACCTTGT CAAGGACCTG AGGATCTCTG CACCCTTATT 
C6ACGTAAGT (3A6TGGAACA GTTCCTGGAC TCCTAGA6AC GTGGGAATAA 

29451 AAGACCCTGT GCGGTCTCAA ACATCTTATT CCCTTTAACT AATAAAAAAA 
TTCTOOGACA CGCCAGAOTT TCTAGAATAA OGQAAATTGA TTATTTTTTT 

29501 AATAATAAAG CATCACTTAC TTAAAATCAG TTAGCAAATT TCTGTCCAGT 
TTATTATTTC G7AGTGAAT6 AATTTTA6TC AATCGTTTAA AGACAGGTCA 

29551 TTATTCAGCA GCACCTCCTT GCCCTCCTCC CAGCTCTGGT ATTGCAGCTT 
AATAAGTCGT CGTGGAGGAA CGGGAGGAGG GTCGAGACCA TAACGTCGAA 

29601 CCTCCTGGCT GCAAACTTTC TCCACAATCT AAATGGAATG TCAGTTTCCT 
GGAGGACCGA CGTTTGAAAG AGGTGTTAGA TTTACCTTAC AGTCAAAGGA 

29651 CCTGTTCCTG TCCATCCGCA CCCACTATCT TCATGTTGTT GCAGATGAAG 
GGACAAGGAC AGGTAGGCGT GGGTGA7AGA AGTACAACAA CGTCTACTTC 

29701 CGCGCAAGAC CGTCTGAAGA TACCTTCAAC CCCGTGTATC CATATGACAC 
GCGCGTTCTG GCAGACTTCT ATGGAAGTTG GGGCACATAG GTATACTGTG 

29751 GGAAACCGGT CCTCCAACTG TGCCTTTTCT TACTCCTCCC TTTGTATCCC 
CCTTTGGCCA 0GA6GTTGAC ACGGAAAAGA ATGAGGAGGG AAACATAGGG 

29801 CCAATGGGTT TCAAQAGAGT CCCCCTGGGG TACTCTCTTT 6CGCCTATCC 
GGTTACCCAA AGTTCTCrCA GGGGGACCCC ATGAGAGAAA CGCGGATAGG 

29851 GAACCTCTAG TTACCTCCAA TGGCATGCTT GCGCTCAAAA TGGGCAACGG 
" CTTGGAGATC AATGGAGGTT ACCGTACGAA CGCGAGfTTT ACCCGTTGC'C 

29901 CCTCTCTCTG GACGAGGCCG GCAACCTTAC CTCCCAAAAT GTAACCACTG 
GGAGAGAGAC CTGCTCCGGC CGTTGGAATG .GAGGGTTTTA CATTGGTGAC 

29951 TGAGCCCACC TCTCAAAAAA ACCAAGTCAA ACATAAACCT GGAAATATCT 
ACTCGGGTGG AGAGTTTTTT TGGTTCAGTT TGTATTTGGA CCTTTATAGA 

30001 GCACCCCTCA CAGTTACCTC AGAAGCCCTA ACTGTGGCTG CCGCCGCACC 
CGTGGGGAGT GiTCAATGGAG TCTTCGGGAT T6ACACCGAC GGCGGCGTGG 

30051 TCTAATGGTC GCGGGCAACA CACTCACCAT GCAATCACAG GCCCCGCTAA 
AGATTACCAG CGCCCGTTGT GTGAGTGGTA CGTTAGTGTC CGGGGCGATT 

30101 CCGTGCACGA CTCCAAACTT AGCATTGCCA CCCAAGGACC CCTCACAGTG 
GGCACGTGCT GAGGTTTGAA TCGTAACGGT GGGTTCCOXSG GGAGTGTCAC 

30151 TCA6AA66AA AGCTAGCCCT GCAftACAICA GGCCCCCTCA CXJICCACCGA 
AGTCTTCCTT TCGATC6GGA CGTTTGTAGT CCSGG6GAG7 GGTGGTGGCT 

30201 TAGCAGTACC CTTACTATCA CTGOCTCACC CCCTCTAACT ACTGCCACTG 
ATCGTCAT06 GAATGATA6T GACG6AGTGG GGGAGATTGA TCAC6GTGAC 

30251 GTAGCTTGGG CATT6ACTTG AAA6AGCCCA TTTATACACA AAATGGAAAA 
CATCGAACCC GTAACTGAAC TTTCTCGGGT AAATATGTGT TTTACCTTTT 
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30301 CTAGGACTAA A|^CGGGGC TCCTTTGCAT GTAACAGAfcc'TliCt^AaMIC 
GATCCTGATO tcPSgCCCCG AGGAAACGTA CATTGTCTGC TGGATT^G 

30351 TTTGACCGTA GCAACTGGTC CAGGTGTGAC TATTAATAAT ACTTCCTTGC 
AAACTGGCAT cgttgaccag GTCCACACTG ATAATTATTA TGAAGGAACG 

30401 AAACTAAAGT TAC9GGAGCC TTGGGTTTTG ATTCACAAGG CAATATGCAA 

tttgatttca atgacctcgg aacccaaaac taagtgttcc gttatacgtt 

30451 cttaatgtag caggaggact aaggattgat tctcaaaaca gacgccttat 
gaattacatc gtcctcctga ttcctaacta agagttttgt ctgcggaata 

30501 acttgatgtt agttatccgt ttgatgctca aaaccaacta aatctaagac 

- TGAACTACAA TCAATAGGCA AACTACGA6T TTTGGTTGAT 7TAGATTCTG 

30551 TAG6ACAGGG CCCTCTTTTT ATAAACTCAG CCCACAACTT GGATATTAAC 
ATCCTCTTCCC GGGAGAAAAA TATTT6AGTC GGGTGTTGAA CCTATAATT6 

30601 TACAACAAAG GCCTTTACTT GTTTACAGCT TCAAACAATT CCAAAAAGCT 
ATGTTGTTTC CGGAAATGAA CAAATGTCGA A6TTTGTTAA G6TTTTTCGA 

30651 TGAGGTTAAC CTAAGCACTC CCAAGGGGTT GATGTTTGAC GCTACAGCCA 
ACTCCAATTG GATTCGTGAC GGTTCCCCAA CTACAAACTG C6ATGTCGGT 

30701 TAGCCATTAA TGCAGGAGAT GCGCTTGAAT TTGGTTCACC TAATGCACCA 
ATCGGTAATT ACGTCCTCTA CCCGAACTTA AACCAAGTGG ATTACGTGGT 

30751 AACACAAATC CCCTCAAAAC AAAAATTGGC CATGGCCTAG AATTTGATTC 
TTGTGTTTA6 GGGAGTTTT6 TTTTTAACCG GTACCGGATC TTAAACTAAG 

30B01 AAACAAGGCT ATGGTTCCTA AACTAGGAAC TGGCCTTAGT TTTGACAGCA 
TTTGXTCCGA TACCAAGGAT TTGATCCTTG ACCGGAATCA AAACT6TCGT 

30851 CAGGTGCCAT TACAGTAGGA AACAAAAATA ATGATAAGCT AACTTTGTGG 
GTCCACGGTA ATGTCATCCT TTGTTTTTAT TACTATTCGA TTGAAACACC 

30901 ACCACACCAG CTCCATCTCC TAACTGTAGA CTAAATGCAG AGAAAGATGC 
TGGTGTGGTC GAG6TAGAGG ATTGACATCT GATTTACGTC TCTTTCTACG 

30951 TAAACTCACT TTG6TCTTAA CAAAATGTGG CAGTCAAATA CTTGCTACAG 
ATTTGAGTGA AACCAGAATT GTTrTACACC GTCAGTTTAT GAACGATGTC 

31001 TTTCAGTrrr GGCTGTTAAA GGCAGTTTGG CTCCAATATC TGGAACAGTT 
AAAGTCAAAA CCGACAATTT CCGTCAAACC GAGGTTATAG ACCTTGTCAA 

31051 CAAAGTGCTC ATCTTATTAT AAGATTT6AC GAAAATGGAG TGCTACTAAA 
GTTTCACGA6 TAGAATAATA TTCTAAACTG CTTTTACCTC ACGAT6ATTT 

31101 CAATTCCTTC CTGGACCCAG AATATTGCAA CTTTAGAAAT GGAGATCTTA 
6TTAAGGAA6 GACCIXSGGTC TTATAACCTT GAAATCTTTA CCTCTAGAAT 

31151 CTGAAGGCAC AGCCTATACA AACGCTGTTO GATTTATGCC TAACCTATCA 
6ACTTCCGT6 TCG6ATATGT TTGCGACAAC CTAAATACGG ATTGGATAGT 

31201 GCTTATCCAA* AATCTCACGG TAAAACTGCC AAAAOTAACA TTOTCAGTCA 
CGAATAGGTT TTA6AGTGCC ATTTTGACGG TTTTCATTGT AACAGTCAGT 
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31251 AGTTTACTTA AJUCGAGACA AAACTAAACC TGTAACACTA ACCATtSP^C 
TCAAATGAAT TTGCCTCTGT TTTGATTTGG ACATTGTGAT TGGTAATGTG 

313 01 TAAACGGTAC ACAGGAAACA GGAGACACAA CTCCAAGTGC ATACTCTATG 
ATTTGCCATG TGTCCTTTGT CCTCTGTGTT GAGGTTCACG TATGAGATAC 

31351 TCATTTTCAT GGGACTGGTC TGGCCACAAC TACATTAATG AAATATTTGC 
AGTAAAAGTA CCCTGACCAG ACCGGTGTTG ATGTAATTAC TTTATAAACG 

31401 CACATCCTCT TACACTTTTT CATACATT6C CCAAGAATAA AGAATCGTTT 
GTGTAGGAGA ATGTGAAAAA GTATGTAACG GGTTCTTATT TCTTAGCAAA 

31451 GTGTTAT6TT TCAACGT6TT TATTTTTCAA TTGCA6AAAA TTTCAAGTCA 
CACAATACAA AGTTGCACAA ATAAAAAGTT AACGTCTTTT AAAGTTCAGT 

31501 TTTTTCATTC AGTAGTATAG CCCCACCACC ACATAGC7TA TACAGATCAC 
AAAAAGTAAG TCATCATATC GGGGTGGTG6 TCTATCGAAT ATGTCTAGTC 

31551 CGTACCTTAA TCAAACTCAC AGAACCCTAG TATTCAACCT GCCACCTCCC 
GCATGGAATT AGTTTGAGTG TCTTGGGATC ATAAGTTGGA CGGT6GAGGG 

31601 TCCCAACACA CAGA6TACAC AGTCCTTTCT CCCCGGCTGG CCTTAAAAAG 
AGG6TTGTGT GTCTCATGT6 TCAGGAAAGA GGGGCCGACC GGAATTTTTC 

31651 CATCATATCA TGGGTAACAG ACATATTCTT AGGTGTTATA TTCCACACGG 
GTAGTATAGT ACCCATTGTC TGTATAAGAA TCCACAATAT AAGGTGTGCC 

31701 TTTCCTGTCG AGCCAAACGC TCATCAGTGA TATTAATAAA CTCCCCGGGC 
AAAGGACAGC TCGGTTTGCG AGTAGTCACT ATAATTATTT GAGGGGCCCG 

31751- AGCTCACTTA AGTTCATGTC-GCTGTCCAGC-TGCTGAGCCA_CAGGC.TGC.TG 
TCGAGTGAAT TCAkGTACAG CGACAG6TCG ACGACTC6GT GTCCGACGAC 

31B01 TCCAACTTGC GGTTGCTTAA CGGGCG6CGA A6QAGAAGTC CACGCCTACA 
AGGTTGAACG CCAACGAATT GCCCGCCGCT TCCTCTTCAG 5TGCQGATGT 

31851 TGGGGGTA6A 6TCATAATCG TGCATCAGGA TAGGGCGGTG 6TGCTGCAGC 
ACCCCCATCT CAGTATTAGC ACGTAGTCCT ATCCC6CCAC CACGACGTCG 

31901 AGCGCGCGAA TAAACTGCTG CCGCCGCCGC TCCGTCCTGC AGGAATACAA 
TCGCGCGCTT ATTTGACGAC 6GCGGCGQCG AGGCAGGACG TCCTTATGTT 

31951 CATGGCAGTG GTCTCCTCAG CGATGATTCG CACCGCCCGC AGCATAAGGC 
GTACCGTCAC CAGAGGAOTC GCTACTAAGC GTG6CGGGC6 TCGTATTCCG 

32001 GCCTTGTCCT CCSGGCACAG CAGCGCACCC TGATCTCACT TAAATCAGCA 
CGGAACAGGA GGCCCGTGTC GTCGCGTQGG ACTAGAGTGA ATTTAGTCGT 

32051 CA6TAACTGC AGCACAGCAC CACAATATTG TTCAAAATCC CACAGTGCAA 
GTCATTGACG TC6TGTCGTC 6TGTTATAAC AAGTTTXAGG GTGTCACGTT 

32101 GGCGCTGTAT CCAAAGCTCA TGGCGGGGAC CACAGAACCC ACGTGGCCAT 
CCGCGACATA GGTTTCGAGT ACCGCCCCTG GTGTCTTGGG TGCACCGGTA 

32151 CATACCACAA 6C6CAGGTAG ATTAAGTGGC GACCCCTCAT AAACACGCTG 
GTATGGTGTT CGCGTCCATC TAATTCACCG CTGGGGAGTA TTTGTGCGAC 
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32202 
32251 
32301 
32351 
32401 
32451 
32501 
32551 
32601 
32651 
32701 
32751 
32B01 
32851 
32901 
32951 
33001 
33051 
33101 



GACATAAACA T^^CTCTTT TGGCATGTTG TAATTCACCA CCTCCCl^A 
CTGTATTTGT AATGGAGAAA ACCGTACAAC ATTAAGTGGT 66A66GCCAT 

CCATATAAAC CTCTGATTAA ACATGGCQCX; ATCCACCACC ATCCTAAACC 
GGTATATTTG GAGACTAATT TGTACC6CGG TA6GTGGTGG TAOSATTTGG 

AGCTGOCCAA AACCTGCCC3 CCGGCTATAC ACTGCAG6GA ACCGG6ACT6 
TCGACCGGTT TTGGACGQGC GGCOGATATG TGACGTCCCT TGGCCCTGAC 

GAACAATGAC AGTGGA6AGC CCAGGACTCG TAACCATGGA TCATCAT6CT 
dTGTTACTG TCACCTCTCG G6TCCTGAGC ATTGGTACCT AGTAGTACGA 

CGTCATGATA TCAATOTT6G CACAACACA6 GCACACGTGC ATACACTTCC 
GCAGTACTAT A6TTACAACC GTGTTGTGTC CGTGTGCACG TATGT6AAGG 

TCAGGATTAC AAGCTCCTCC CCCGTTAGAA CCATATCCCA GGGAACAACC 
AGTCCTAATG TTCGAGGAGG GCGCAATCn GGTATAGGGT CCCTTGTTGG 

CATTCCTGAA TCAGCGTAAA TCCCACACTG CAGGGAAGAC CTCGCACGTA 
GTAAGGACTT AGrCGCATTT A6GGTGTGAC GTCCCTTCT6 GAGCGT6CAT 

ACTCACGTT6 TGCATTGTCA AAGTGTTACA TTC6G6CAGC AGCGGATGAT 
TGAGTGCAAC ACGTAACAGT TTCACAATGT AAGCCCGTCG TCGCCTACTA 

CCTCCAGTAT GGTAGCGCGG GTTTCTGTCT CAAAAGGAGG TAGACGATCC 
GGAGGTCATA CCATCGCGCC CAAAGACAGA GTTTTCCTCC ATCTGCTAGG 

CTACT6TACG GAGTGCGCXIG AGACAACCGA GATCGTGTTG 3TCGTAGTGT 
GATGACATGC CTCACGCGGC TCTGTTG3CT CTAGCACAAC CAGCATCACA 

CATGCCAAAT GGAAC6CCGG ACGTAGTCAT ATTTCCTGAA GCAAAACCAG 
GTACGGTTTA CCTTGCGGCC TGCATCAGTA TAAAGGACTT CGTTTTGGTC 

GTGCGGGCGT GACAAACAGA TCTGCGTCTC CGGTCTCGCC GCTTAGATC5 
CACGCCCGCA CTGTTTGTCT AGACGCA3AG GCCAGAGCGG CGAATCTAGC 

CTCTGTGTAG TAGTTGTAGT ATATCCACTC TCTCAAAGCA TCCAGGCGCC 
GAGACACATC ATCAACATCA TATAGGTGAG AGAGTTTCGT AGGTCCGCGG 

CCCTGGCTTC GGGTTCTATG TAAACTCCTT CATGCGCCGC TGCCCTGATA 
GGGACCGAAG CCCAAGATAC ATTTGAGSAA GTACGC6GCG ACGGGACTAT 

ACATCCACCA CCGCAGAATA AGCCACACCC AGCCAACCTA CACATTCGTT 
TGTAGGTGGT GGCGTCTTAT TCGGTGTGGG TCGGTTGGAT GTGTAAGCAA 

CTGCGAGTCA CACACGGGAG GAGCGGGAAO AGCTGGAAGA ACCATGTTTT 
GACGCTCAGT GTGTGCCCTC CTCGCCCTTC TCGACCTTCT TG6TACAAAA 

TTTTTTTATT CCAAAAGATT ATCCAAAACC TCAAAATGAA GATCTATTAA 
AAAAAAATAA GGTTTTCTAA TAGGTTTTGG AGTTTTACTT CTAGATAATT 

GTGAACGCGC TCCCCTCCGG TGGCGTGGTC AAACTCTACA GCCAAAGAAC 
CACTTGCGCG AGGGGAGGCC ACCGCACCAG TTT6AGATGT CGGTTTCTTG 

AGATAATGGC ATTTGTAAGA TGTTGCACAA TGGCTTCCAA AAGGCAAACG 
TCTATTACCG TAAACATTCT ACAACGTGTT ACCGAAGGTT TTCCGTTTGC 
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33151 
33201 
33251 
33301 
33351 
33401 
33451 
3350X 
33551 
33601 
33651 
33701 
33751 
33801 
33B51 
33901 
33951 
34001 
34051 



CGGGAGTGCA ^■tCACCTG CATTTCCGAT TTGGGAAGTC CCACTIfJBRG 

CTCTATAAAC ATTCCAGCAC CTTCAACCAT GCCCAAATAA TTCTCATCTC 
GAGATATTTG TAAGGTCGTG GAAGTTGGTA CGGGTTTATT XAC3AGTAGAG 

6CCACCTTCT CAATATATCT CTAAGCAAAT CCCGAATATT AASTCCGGCC 
CGGTGGAAGA GTTATATAGA GATTCGTTTA 6GGCTTATAA TTCA6GCCGG 

ATTGTAAAAA TCTGCTCCAG AGCGCCCTCC ACCTTCAGCC TCAAGCAGCG 
TAACATTTTT AGACGAGGTC TCGCGGGAGG TGGAAGTCGG AGTTCGTCGC 

AATCATGATT QCAAAAATTC AGGTTCCTCA CAGACCTGTA TAAGATTCAA 
TTAGTACTAA CGTTTTTAA6 TCCAAGGAGT 6TCTGGACAT ATTCTAAGTT 

AAGCGGAACA TTAACAAAAA TACCGCGATC CC6TAGGTCC CTTCGCAGGG 
TTCOCCTTGT AATTGTTTTT ATOQCGCTAG GGCATCCAGG GAACCCTCCC 

CCAGCTGAAC ATAATCGTGC AGGTCT6CAC 6GACCAGCGC GQCCACTTCC 
GGTCGACTTG TATTAGCACG TCCAGACGTG CCTGGTCGCG COGGTGAAGG 

CC6CCAGGAA CCATGACAAA AGAACCCACA CTGATTATGA CACGCATACT 
GGCGGTCCTT GGTACTGTTT TCTTGGGTGT GACTAATACT GTGCGTATGA 

CGGAGCTATG CTAACCAQCG TAGCCCCGAT GTAAGCTTGT TGCATGGGCG 
GCCTCGATAC 6ATTGGTCGC .ATCGGGGCTA CATTCGAACA ACGTACCCGC 

GCGATATAAA ATGCAAGGTG CTGCTCAiAA AATCAGGCAA AGCCTCGCGC 
CGCTATATTT TACGTTCCAC GACGAGTTTT TTAGTCCGTT TCG6AGCGCG 

AAAAAAGAAA GCACATCGTA GTCATGCTCA TGCAGATAAA GGCAGGTAAG 
TTTTTTCTTT CGTSTAG CAT C AGTACG AGT ACGTCTATTT CCGTCCATTC 

CTCCGGAACC ACCACAGAAA AAGACACCAT TTTTCTCTCA AACATGTCTG 
GAGGCCTTG6 TQQTGTCTTT TTCTGTG3TA AAAAGAGAGT TTGTACAGAC 

CGGGTTTCTG CATAAACACA AAATAAAATA ACAAAAAAAC ATTTAAACAT 
GCCCAAAGAC GTATTTGTGT TTTATTTTAT TGTTTTTTTG TAAATTTGTA 

TAGAAGCCTG TCTTACAACA CGAAAAACAA CCCTTATAAG CATAAGACGG 
ATCTTCGGAC AGAATGTTGT CCTTTTTGTT GGGAATATTC GTATTCTGCC 

ACrACGGCCA TGCCGGCGTG ACCGTAAAAA AACTGGTCAC CGTGATTAAA 
TGATQCCGGT ACOGCCGCAC TGGCATTTTT TTGACCAGTG GCACTAATTT 

AAGCACCACC GACAGCTCCT CGGTCATGTC CGGAGTCATA ATGTAAGACT 
TTCGTGGTGG CTGTTCGAGGA GCCAGTACAG GCCTCAGTAT TACATTCT6A 

CGGTAAACAC ATCAGGTTGA TTCACATCGG TCAGTGCTAA AAAGCGACCG 
GCCATTTGTG TAGTCCAACT AAGTGTAGCC AGTCACGATT TTTCGCTGGC 

AAATAGCCCG GGG6AATACA TACXICGCAGG CGTAGAGACA ACATTACAGC 
TTTATCG6GC CCCCTTAT6T ATGGGCGTCC GCATCTCTGT TGTAATGTCG 

CCCCATAGGA GGTATAACAA AATTAATAG6 AGAGAAAAAC ACATAAACAC 
GGGGTATCCT CCATATTGTT TTAATTATCC TCTCTTTTTG TGTATTTGTG 
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3410X 
34151 
34201 
34251 
34301 
34351 
344D1 
34451 
34501 

34551 
34601 
34S51 
34701 
34751 
34801 
34B51 
34901 
34951 
35001 



CrGAAAAACCiCMi.TGCCTA GGCAAAATAG CACCCTCCG«e'«6TG»i^J«A 
GACTTTTTGG GJ^BACGGAT CCGTTTTATC QTGGGAGGGC GAGGTC1BBT 

ACATACAGCG CTTCCACAQC GGCAGCCATA ACAGTCAGCC TTACCAGTAA 
TGTATGTCGC GAAGGTGTCG CCGTCGGTAT TGTCAGTCGG AATGGTCATT 

AAAAGAAAAC CTATTAAAAA AACACCACTC GACACGGCAC CA6CTCAATC 
TTTTCTTTTG GATAATTTTT TTGTGGTGAG CTGTGCCQTO OTCGAGTTAQ 

AGTCACAGTG TAAAAAAGGG CCAA6TGCAG AGCGAGTATA TATAGGACTA 
TCAGTGTCAC ATTTTTTCCC GGTTCAC6TC TCCCTCATAT ATATCCTGAT 

AAAAATGACG TAACGGTTAA AGTCCACAAA AAACACCCAG AAAACCGCAC 
TTTTTACTGC ATTGCCAATT TCAGGTCTTT TTTGTGG6TC TTTTGGCGTG 

GCGAACCTAC GCCCAGAAAC 6AAAGCCAAA AAACCCACAA CTTCCTCAA& 
CGCTTGGATG CGGGTCTTTG CTTTCGCTTT TTTGGGTCTT GAAGGAGTTT 

TCGTCACTTC CGTTTTCCCA CGTTACGTCA CTTCCCATTT 7AAGAAAACT 
AGCAGTGAAG GCAAAAOGGT GCAATGCA6T 6AAGGGTAAA ATTCTTTTGA 

ACAATTCCCA ACACATACAA GTTACTCCGC CCTAAAACCT AC3TCACCCG 
TGTTAAGGGT TGTGTATGTT CAATGAOGCG GGATTTT6GA TGCAGTGGGC 

CCCCGTTCCC ACGCCCC6CG CCACCTCACA AACTCCACCC CCTCATTATC 
GGGGCAAGGG TGCGGGGCGC GGTGCAGT6T TT6AGGTGGG GGAGTAATAG 



ATATTGGCTT CAATCCAAAA TAAGGTATAT TATTGATGAT GTTAATTAAG 
TATAACCGAA GTTAGGTTTT ATTCCATATA ATAACTACTA CAATTAATTC 

AATTCGGATC TGCGAC6CGA GGCTGGATGG CCTTCCCCAT TATGATTCTT 
TTAAGCCTAG ACGCTGCGCT CCGACCTACC GGAAGGGGTA ATACTAAGAA 

CTCGCTTCCG GC6GCATCGG 6ATGCCCGCG TTGCAGGCCA rGCTGTCCAG 
GAGCGAAGGC CGCCGTAGCC CTACG6GCGC AACGTCCGGT ACGACAGGTC 

GCAGGTAGAT GACGACCATC AGGGACAGCT TCAAGGCCAG CAAAAGGCCA 
CGTCCATCTA CTGCTGGTAG TCCCTGTCGA AGTTCCGGTC GTTTTCCGGT 

GGAACCGTAA AAAGGCCGCG TTGCTGGCGT TTTTCCATAG GCTCCGCCCC 
CCTTGGCATT TTTOCGGCGC AACGACCGCA AAAAGGTATC CCAGGCGGGG 

-CCTGACGAGC ATCACAAAAA TCGACGCTCA AGTCAGAGGT GGCGAAACCC 
GGACTGCTCG TA6TGTTTTT AGCTGC6AGT TCAGTCTCCA CCGCTTTGGG 

GACAG6ACTA TAAAGATACC AGGCGTTTCC CCCTGGAAGC TCCCTCGTGC 
CT6TCCTGAT ATTXCTATGG TCCQCAAAGG GGGACCTTCC ACCGAGCACC 

GCTCTCCTGT TCCGACCCTG CCGCTTACCG GATACCTGTC CGCCTTTCTC 
C6AGAGGACA AGGCTGG6AC GGCGAATGGC CTATGGACAG GCGGAAAGAG 

CCTTCGGGAA GCGTGGCGCT TTCTCATAGC TCACGCTGTA GGTATCTCAG 
GGAAGCCCTT C6CACCGCGA AA6AGTATCG AGTGCGACAT CCATAGAGTC 

TTCGGTGTAG GTCGTTCGCT CCAAOCTGGG CTGTGTGCAC GAACCCCCCG 
AAGCCACATC CAGCAAGCGA G6TTCGACCC 6ACACACGTG CTTGGGGGGC 
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35051 TTCAGCCCGA CCGCTGCGCC TTATCCGGTA ACTATCGTCT TGAGTCCAAC 
AAGTCGGGCT GGCGACGCGG AATAGGCCAT TGATAGCAGA ACTCAGGTTG 

35101 CCGGTAAGAC ACGACTTATC GCCACTGGCA GCAGCCACTG GTAACAGGAT 
GGCCATTCTG TGCTGAATAG CGGTGACCGT CGTCGGTGAC CATTGTCCTA 

35151 TAGCAGA6CG AGGTATCTAG GCG6TGCTAC AGAGTTCTTG AAGTGGTGGC 
ATCGTCTCGC TCCATACATC CGCCAC6ATG TCTCAAGAAC TTCACCACCG 

35201 CTAACTACGG CTACACTAGA ASGACAGTAT TTGGTATCTG CGCTCTGCTG 
GATTGATGCC GATGTGATCT TCCTGTCATA AACCATAGAC 6CGA6ACGAC 

35251 AAGCCAGTTA CCTTCGGAAA AAGA6TTGGT A6CTCTTGAT CCGGCAAACA 
TTCGGTCAAT GGAAGCCTTT TTCTCAACCA TCGAGAACTA G6CCGTTTGT • 

35301 AACCACCGCT GOTAGCGGTG GTTTTTTTGT TTGCAAGCAG CAGATTAC6C 
TTGGTGGC6A CCATCGCCAC CAAAAAAACA AACGTTCGTC GTCTAATGCG 

35351 GCAGAAAAAA AGGATCTCAA GAAGATCCTT TGATCTTTTC TACGGGGTCT 
CGTCTTTTTT TCCTAGAGTT CTTCTAGGAA ACTAGAAAAG ATGCCCCAGA 

35401 GACGCTCAGT GGAACGAAAA CTCACGTTAA GGGATTTTGG TCATGAGATT 
CTGCGAGTCA CCTTGCTTTT GAGTGCAATT CCCTAAAACC AGTACTCTAA 

35451 ATCAAAAAGG ATCTTCACCT AGATCCTTTT AAATCAATCT AAAGTATATA 
TAGTTTTTCC TAGAAGTGGA TCTAGGAAAA TTTAGTTAGA TTTCATATAT 

35501 TGAGTAAACT TGGTCTGACA GTTACCAATG CTTAATCAGT GAGGCACCTA 
ACTCATTTGA ACCAGACTGT CAATGGTTAC GAATTAGTCA CTCCGTGGAT 

35551" TCTCAGCGAT CTGTCTATfT CGTTCATCCA TAGTTGCCTG ACTCCCCGTC 
AGAGTCGCTA GACAGATAAA GCAAGTAGGT ATCAACGGAC TGAGGGGCAG 

35601 GTGTA6ATAA CTACGATACG GGAGGGCTTA CCATCTGGCC CCAGTGCTGC 
CACATCTATT GATGCTATGC CCTCCCGAAT GGTAGACCGG GGTCACGACG 

35651 AATGATACCG CGAGACCCAC GCTCACCGGC TCCAGATTTA TCAGCAATAA 
TTACTATGGC GCTCTGG6TG CGAGTGGCCG AGGTCTAAAT AGTCGTTATT 

35701 ACCAGCCAGC CGGAAGGGCC^ GAGCGCAGAA GTGGTCCTGC AACTTTATCC 
TC6TCGGTCG GCCTTCCCGG CTCGCGTCTT CACCAGGACG TTGAAATAGG 

35751 GCCTCCATCC AGTCTATTAA TTGTTGCCGG GAAGCTAGAG TAAGTAGTTC 
COGAGGTAGG TCAfiATAATT AACAACGGCC CTTCGATCTC ATTCATCAAG 

35801 GCCAGTTAAT AGTTTGC6CA ACGTTGTTGC CATTGCTACA G6CATCGTGG 
CGGTCAATTA TCAAACGCGT TGCAACAACG GTAACGATGT CCGTAGCACC 

35B51 TGTCACGCTC GTC6TTTGGT AT6GCTTCAT TCAGCTCCGG TTCCCAACGA 
ACAGTGCGAG CAGCAAACCA TACCGAAGTA AGTCGAGGCC AAGGGTTGCT 

35901 TCAAGGCGAG TTACATGATC CCCCATGTTG TGCAAAAAAG CGGTTAGCTC 
AGTTCCGCTC AATGTACTAG GGGGTACAAC ACGTTTTTTC QCCAATCGAG 

35951 CTTCGGTCCT CCGATCGTTG TCAGAAGTAA GTTGGCCGCA GTGTTATCAC 
GAAGCCAGGA GGCTAGCAAC AGTCTTCATT CAACCGGCGT CACAATAGTG 
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36001 TCATGGTTAT 'IBBiGCACTG CATAATTCTC TTACTGTCAT GCCAT^PtA 
AGTACCAATA CCOTCGTGAC GTATTAAGAG AATGACAGTA CGGTAGGCAT 

36051 AGATGCTTTT CTGTGACTGG TGAGTACTCA ACCAA6TCAT, TCTG7U3AATA 
TCTACGAAAA GACACTGACC ACTCAT6AGT TGGTTCAGTA AGACTCTTAT 

36101 GTGTATGCG6 CGACCGAGTT GCTCTTGCCC GGCGTCAACA CGGGATAATA 
CACATACGCC GCTGGCTCAA CGAGAACGGG CCGCAGTTGT GCCCTATTAT 

36151 CCGCGCCACA TAGCAGAACT TTAAAAGTGC TCATCATTGG AAAAC6TTCT 
GGCGCGGTGT ATCGTCTTGA AATTTTCACG AGTAGTAACC TTTTGCAAGA 

36201 TCGGGGCGAA AACTCTCAAG GATCTTACCG CTGTT6AGAT CCAGTTCGAT 
AGCCCCGCTT TTGAGAGTTC CTAGAATGGC GACAACTCTA GGTCAAGCTA 

36251 GTAACCCACT CGTGCACCCA ACTGATCTTC AGCATCTTTT ACTTTCACCA 
CATTGGGTGA GCACGTGGGT TGACTAGAAG TCGTAGAAAA TGAAAGTGGT 

36301 ■ GCGTTTCTGG GTGAGCAAAA ACAGGAAGGC AAAATGCCGC AAAAAAGGGA 
CGCAAAGACC CACTCGTTTT TGTCCTTCCG TTTTACGGCG TTTTTTCCCT 

36351 ATAAGGGCGA CACGGAAATG TTGAATACTC ATACTCTTCC TTTTTCAATA 
TATTCCCGCT GTGCCTTTAC AACTTATGAG TATGAGAAGG AAAAAGTTAT 

36401 TTATTGAAGC ATTTATCAGG GTTATTGTCT CA-rGAGCGGA TACATATTTG 
AATAACTTCG TAAATAGTCC CAATAACAGA GTACTCGCCT ATGTATAAAC 

36451 AATGTATTTA GAAAAATAAA CAAATAGGGG TTCCGCGCAC ATTTCCCCGA 
TTACATAAAT CTTTTTATTT 6TTTATCCCC AAGGCGCGTG TAAAGGGGCT 

36501 AAAGTGCCAC CTGACGTCTA AGAAACCATT ATTATCATGA CATTAACCTA 
TTTCACGGTG GACTGCAGAT TCTTTGGTAA TAATA6TACT GTAATTGGAT 

36551 TAAAAATAGG CGTATCACGA GGCCCTTTCG TCTTCAAGAA TTGGATCCGA 
ATTTTTATCC GCATAGTGCT CCGGGAAAGC AGAAGTTCTT AACCTAGGCT 

Pad 



36601 ATTCTTAATT TCTTAATTAA (SEQ ID NO: 34) 
TAAGTATTAA AGAATTAATT (SEQ ID NO: 35) 
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atg 
Met 
1 


gat gca atg aag 
Asp Ala Met Lys 

5 


aga ggg etc tgc 
Arg Gly Leu Cys 


tgt 
Cys 
10 


gtg 
Val 


ctg 
Leu 


ctg 
Leu 


ctg 
Leu 


tgt 
Cys 
15 


gga 

Gly 


48 


gca 
Ala 


gtc 
Val 


ttc 
Phe 


gtt 

Val 
20 


teg 
Ser 


ccc 
Pro 


age 
Ser 


gag 
Glu 


ate 
He 
25 


tec 
Ser 


att 
He 


gtg 
Val 


tgg gee 
Trp Ala 
30 


tee 
Ser 


agg 
Arg 


96 


gag 
61u 


ctg gag agg ttt 
Leu 6lu Arg Phe 
35 


get gtg 
Ala Val 


aac 
Asn 
40 


cct 
Pro 


ggc 
Gly 


ctg 
Leu 


ctg 
Leu 


gag acc 
Glu Thr 
45 


tct 
Ser 


gag 
Glu 


144 


ggg 
Gly 


tgc 
Cys 
50 


agg 
Arg 


cag 
Gin 


ate 
He 


ctg ggc cag etc 
Leu Gly Gin Leu 
55 


cag 
Gin 


ccc 
Pro 


tec 
Ser 
60 


ctg 
Leu 


caa 
Gin 


aca 
Thr 


ggc 

Gly 


192 


tct 
Ser 
65 


gag gag ctg agg 
Glu Glu Leu Arg 


tee 
Ser 
70 


ctg 
Leu 


tac 
Tyr 


aac 
Asn 

• 


aca 
Thr 


gtg 
Val 
75 


get 
Ala 


ace ctg 
Thr Leu 


tac 
Tyr 


tgt 
Cys 
80 


240 


gtg 
Val 


cac 
His 


cag aag 
Gin Lys 


att 
He 
85 


gat gtg aag gac 
Asp Val Lys Asp 


acc 
Thr 
90 


aag 
Lys 


gag 

Glu 


gee 
Ala 


ctg 
Leu 


gag 

Glu 
95 


aag 
Lys 


288 


att 
He 


gag 
Glu 


gag 
Glu 


gag 
Glu 
100 


cag 
Gin 


aac aag 
Asn Lys 

* 


tec 
Ser 


aag 
Lys 
105 


aag 
Lys 


aag 
Lys 


gee 
Ala 


cag 
Gin 


cag 
Gin 
110 


get 
Ala 


get 
Ala 


336 


get 
Ala 


ggc aca ggc aac 
Gly Thr Gly Asn 
115 


tec 
Ser 


age 
Ser 


cag 
Gin 
120 


gtg 
Val 


tec 
Ser 


cag 
Gin 


aac 
Asn 


tac 
Tyr 
125 


.ccc 
Pro 


att 
He 


gtg 
Val 


384 


cag 
Gin 


aac etc cag ggc 

Asn -I*eu Gin 61y_ 
130 


cag 

_Gln_ 


atg 

_Met . 
135 


gtg 
^Val 


cac 

..His. 

• 


cag 
.Gin 


gee 
Al.a_ 


ate 
He 
140 


tee 
Ser 


ccc 
P^ 


egg 
Arg 


ace 
Thr 


432 


ctg 
Leu 
145 


aat 
Asn 


gcc 
Ala 


tgg 
Trp 


gtg 
Val 


aag 
Lys 
150 


gtg 
Val 


gtg 
Val 


gag 
Glu 


gag 
Glu 


aag 
Lys 
155 


gcc 
Ala 


ttc 
Phe 


tec 
Ser 


cct 
Pro 


gag 
Glu 
160 


480 


gtg 

Val 


ate 
He 


ccc 
Pro 


atg 
Met 


ttc 
Phe 
165 


tct 
Ser 


gee 
Ala 


ctg 
Leu 


tct 
Ser 


gag 
Glu 
170 


ggt 

Gly 


gcc 
Ala 


acc 
Thr 


ccc 
Pro 


cag 
Gin 
175 


gac 
Asp 


528 


ctg 
Leu 


aac 
Asn 


acc 
Thr 


atg 

Met 
160 


ctg 
Leu 


aac 
Asn 


aca gtg 
Thr Val 


ggg 

Gly 
185 


ggc 

Gly 


eat 
His 


cag 
Gin 


get 
Ala 


gee 
Ala 
190 


atg 
Met 


cag 
Gin 


576 


atg 
Idet 


ctg 
Leu 


aag 
Lys 
195 


gag 
Glu 


acc 
Thr 


ate 
He 


aat 
Asn 


gag 
Glu 
200 


gag 
Glu 


get 
Ala 


get 
Ala 


gag 
Glu 


tgg gac 
Trp Asp 
205 


agg 
Arg 


ctg 
Leu 


624 


cat 
His 


cct 
Pro 
210 


gtg 
Val 


cac 
His 


get 
Ala 


ggc ccc att 
Gly Pro He 
215 


gcc 
Ala 


ccc 
Pro 


ggc 
Gly 


cag 
Gin 


atg agg 
Met Arg 


gag 
Glu 


ccc 
Pro 


672 


agg 
Arg 
225 


ggc tct gac att 
Gly Ser Asp He 


get 
Ala 
230 


ggc acc 
Gly Thr 


acc 
Thr 


tec 
Ser 


acc 
Thr 
235 


etc 
Leu 


cag 
Gin 


gag 
Glu 


cag 
Gin 


att 
He 
240 


720 


ggc 
Gly 


tgg atg 
Trp Met 


acc aac 
Thr Asn 
245 


aac 
Asn 


ccc 
Pro 


ccc 
Pro 


ate 
He 


cct 
Pro 
250 


gtg 

Val 


ggg 

Gly 


gaa 
Glu 


ate 
He 


tac 
Tyr 
255 


aag 
Lys 


768 
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agg tgg ate ate ctg ggc ctg aac aag att gtg agg atg tac tec ccc 816 

Arg Trp lie He Leu Gly Leu Asn Lys He Val Arg Met Tyr Ser Pro 

260 265 270 

acc tec ate ctg gac ate agg eag gge ccc aag gag ccc ttc agg gac 864 

Thr Ser He Leu Asp He Arg Gin Gly Pro Lys Glu Pro Phe Arg Asp 
275 280 285 

tat gtg gae agg ttc tac aag acc ctg agg get gag cag gcc tec eag 912 

Tyr Val Asp Arg Phe Tyr Lys 'Hhr Leu Arg Ala Glu Gin Ala Ser Gin 
290 295 300 

gag gtg aag aac tgg atg aca gag acc ctg ctg gtg cag aat gee aac 960 

Glu Val Lys Asn Trp Met Thr Glu Thr Leu Leu Val Gin Asn Ala Asn 

305 310 315 320 

cct gac tgc aag acc ate ctg aag gcc ctg ggc cct get gcc acc ctg 1008 

Pro Asp Cys Lys Thr He Leu Lys Ala Leu Gly Pro Ala Ala Thr Leu 

325 330 335 



gag gag atg atg aca gcc tgc cag ggg gtg ggg ggc cct ggt cac aag 
Glu Glu Met Met Thr Ala Cys Gin Gly Val Gly Gly Pro Gly His Lys 

340 345 350 



1056 



gcc agg gtg ctg get gag gcc atg tec cag gtg acc aac tec gcc acc 1104 
Ala Arg Val Leu Ala Glu Ala Met Ser Gin Val Thr Asn Ser Ala Thr 
355 360 365 

ate atg atg cag agg ggc aac ttc agg aac cag agg aag aca gtg aag 1152 
He Met Met Gin Arg Gly Asn Phe Arg Asn Gin Arg Lys Thr Val Lys 
370 375 380 

tgc ttc aac tgt ggc aag gtg ggc cac att gcc aag aac tgt agg gcc 1200 
Cys Phe Asn Cys Gly Lys Val Gly His He Ala Lys Asn Cys Arg Ala 
385 390 395 400 

ccc agg aag aag ggc tgc tgg aag tgt ggc aag gag ggc cac cag atg 1248 
^ro Arg Lys Lys Gly Cys Trp Lys Cys Gly Lys Glu Gly His Gin Met 

405 410 415 

aag gac tgc aat gag agg eag gcc aac ttc ctg ggc aaa ate tgg ccc 1296 
Lys Asp Cys Asn Glu Arg Gin Ala Asn Phe Leu Gly Lys He Trp Pro 

420 425 430 

tee cac aag gge agg cct ggc aac ttc etc cag tec agg cct gag ccc 1344 
Ser His Lys Gly Arg Pro Gly Asn Phe Leu Gin Ser Arg Pro Glu Pro 
435 440 445 

aca gcc cct ccc gag gag tec ttc agg ttt ggg gag gag aag acc acc 1392 
Thr Ala Pro Pro Glu Glu Ser Phe Arg Phe Gly Glu Glu Lys Thr Thr 
450 455 460 

ccc age cag aag cag gag ccc att gac aag gag ctg tac ccc ctg gcc 1440 
Pro Ser Gin Lys Gin Glu Pro He Asp Lys Glu Leu Tyr Pro Leu Ala 
465 470 475 480 

tec ctg agg tec ctg ttt ggc aac gac ccc tee tec cag taa (SID NO: 36) 1482 
Ser Leu Arg Ser Leu Phe Gly Asn Asp Pro Ser Ser Gin * (SID NO: 37) 

485 490 
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FIGURE 33A 

ATGGGTGCTA GGGCTTCTGT GCTGTCTGGT GGTGAGCT6G ACAAGTGG6A GAAGATCAGG 
CTGAGGCCTG GTGGCAASAA GAAGTACAAG CTAAAGCACA TTGTGTGGGC CTCCAGGGAG 
CTGGAGAGGT TTGCTGTGAA CCCTGGCCTG CTGGAGACCT CTQAGGQ6TG CAGGCAGATC 
CTGGGCCAGC TCCAGCCCTC CCTGCAAACA GGCTCTGAGG AGCTGAGGTC CCTGTACAAC 
ACAGTGGCTA CCCTGTACT6 TGTGCACCA6 AAGATTGATG T6AAG6ACAC CAAGGAGGCC 
CTGGAGAAGA TTGAGGAGGA GCAGAACAAG TCCAAfiAAGA AGGCCCAGCA GGCTGCTGCT 
GGCACAGGCA ACTCCAGCCA GGTGTCCCAG AACTACCCCA TTGTGCA6AA CCTCCAGGGC 
CAGATG6TGC ACCAGGCCAT CTCCCCCCGG ACCCTGAAT6 CCTGGGTSAA GGTGGTGGAG 
GA6AAGGCCT TCTCCCCTGA GGTGATCCCC ATGTTCTCTG CCCTGTCTGA GGGTGCCACC 
CCCCAGGACC TGAACACCAT GCT6AACACA GTGGGGGGCC ATCAGGCTGC CATGCAGATG 
CTGAAGGAGA CCATCAATGA GGAGGCTGCT GAGTGGGACA GGCT6CATCC TGTGCACGCT 
GGCCCCATTG CCCCCGGCCA GATGAGGGAG CCCAGGGGCT CT6ACATTGC TGGCACCACC 
TCCACCCTCC AG6AGCAGAT TGGCTGGATG ACCAACAACC CCCCCATCCC TGTGGGGGAA 
ATCTACAAGA GGTGGATCAT CCTGGGCCTG AACAA6ATTG TGAGGATGTA CTCCCCCACC 
TCCATCCTGG ACATCAGGCA GGGCCCCAAG GAGCCCTTCA GGGACTATGT GGACAG6TTC 
TACAAGACCC TGAGGGCTGA GCAGGCCTCC CAGGAGGTGA AGAACTGGAT GACAGAGACC 
CTGCTGGTGC AGAATGCCAA CCCTGACTGC AAGACCATCC TGAAGGCCCT GGGCCCTGCT 
GCCACCCTGG AGGAGATGAT GACAGCCTGC CAGGGGGT6G GGGGCCCTGG TCACAAGGCC 
AGGGTGCTGG CTGAGGCCAT GTCCCAGGTG ACCAACTCCG CCACCATCAT GATGCAGAGG 
6GCAACTTCA~GGAACCAGAG"GAAGACAGTG AAGTGCTTCA ACTGTGGGAA- GGTGGGGGAG- 
ATTGCCAAGA ACTGTAGGGC CCCCAG6AAG AAGGGCTGCT GGAAGTGTGG CAAGGAGGGC 
CACCAGATGA AGGACTGCAA TGAGAGGCA6 6CCAACTTCC TGGGCAAAAT CTGGCCCTCC 
CACAAGGGCA GGCCTGGCAA CTTCCTCCAG TCCAGGCCTG AGCCCACAGC CCCTCCCGAG 
GAGTCCTTCA GGTTTGGQGA GGAGAAGACC ACCCCCAGCC AGAAGCAGGA GCCCATTGAC 
AAGGAGCTGT ACCCCCTGGC CTCCCTGAGG TCCCTGTTTG GCAACGACCC CTCCTCCCAG 
ATGGCTCCCA TCTCCCCCAT TGAGACTGTG CCTGTGAAGC TGAAGCCTGG CATGGATGGC 
CCCAAGGTCA AGCAGTGGCC CCTGACTGAG GAGAAGATCA AGGCCCTGGT GGAAATCTGC 
ACTGAGATGG AGAAGGAGGG CAAAATCTCC AAGATTG6CC CCGAGAACCC CTACAACACC 
CCTGT6TTTG CCATCAAGAA GAAGGACTCC ACCAAGTSGA GGAAGCTGGT GGACTTCAGG 
GAGCTGAACA AGAGGACCCA G6ACTTCTGG GAGGTGCAGC TGGGCATCCC CCACCCCGCT 
GGCCTGAAGA AGAAGAAGTC TGTGACTGTG CTGGCTGTGG GGGATGCCTA CTTCTCTGT6 
CCCCTGGAT6 AG6ACTTCAG GAAGTACACT GCCTTCACCA TCCCCTCCAT CAACAATGAG 
ACCCCTCGCA TCAGGTACCA GTACAATGTG CTGCCCCAGG GCTCGAAGGG CTCCCCT6CC 
ATCTTCCAGT CCTCCATGAC CAAGATCCTG GAGCCCTTCA GGAAGCA6AA CCCTGACATT 
GT6ATCTACC ACTACATG6C TGCCCTGTAT GTGGGCTCTG ACCTGGAGAT TGGGCAGCAC 
AGGACCAAGA TTGAGGAGCT GAG6CAGCAC CTGCTCAGGT GGGGCCTSAC CACCCCTGAC 
AA6AAGCACC AGAASGAGCC CCCCTTCCTG TGGATGGGCT ATGA6CTGCA CCCCGACAA6 
TGGACTGTGC AGCCCATTCT GCTGCCTGAG AAGGACTCCT GGACTGTGAA TGACATCCAG 
AAGCTG6TGG GCAA6CTGAA CTGGGCCTCC CAAATCTACC CTGGCATCAA 6GTGAGGCAG 
CTGTGCAAGC TGCTGAGGGG CACCAAGGCC CTGACTGAGG TGATCCCCCT GACTGAGGAG 
GCTGAGCTGG AGCTGGCTGA GAACAGGGAG ATCCTGAAGG AGCCTGTGCA TGGGGTGTAC 
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FIGURB 33B 

TATGACCCCT CCAAGGACCT GATTGCTGAG ATCCAGAAGC AGGGCCAGGG CCAGTGGACC 
TACCAAATCT ACCAGGAGCC CTTCAAGAAC CTGAAGACTG GCAAGTATGC CAGGATGAGG 
GGGGCCCACA CCAATGATGT GAAGCAGCTG ACTGAGGCTG TGCAGAAGAT CACCACTGAG 
TCCATTGTGA TCTGGGGCAA GACCCCCAAG TTCAAGCTGC CCATCCAGAA GGAGACCTGG 
GAGACCTGGT GGACTGAGTA CTGGCAGGCC ACCTGGATCC CT6AGTGGGA GTTTGTGAAC 
ACCCCCCCCC TGGTGAAGCT GTGGTACCAG . CTGGAGAAGG AGCCCATTGT GGGGGCTGAG 
ACCTTCTATG TGGCTGGGGC TGCCAACA6G GAGACCAAGC TGGGCAAGGC TG6CTATGTG 
ACCAACAGGG GCAGGCAGAA GGTGGTGACC CTGACTGACA CCACCAACCA GA^GACTGCC 
CTCCAGGCCA TCTACCTGGC CCTCCAGGAC TCTGGCCTGG AGGTGAACAT TGTGACTGCC 
TCCCA6TATG CCCT6GGCAT CATCCAG6CC CAGCCTGATC AGTCTGAGTC TGAGCTGGTG 
AACCAGATCA TTGAGCAGCT GATCAAGAAG GAGAAGGTGT ACCTGGCCTG GGTGCCTGCC 
CACAAGGGCA TTGGGGGCAA TGAGCAGGTG GACAAGCTGG TGTCTGCTGG CATCAGGAAG 
GTGCT6TTCC TGGATGGCAT T6ACAAGGCC CAGGATGAGC AT6AGAAGTA CCACTCCAAC 
TGGAGGGCTA TGGCCTCTGA CTTCAACCTG CCCCCTGTGG TGGCTAAGGA GATTGTGGCC 
TCCT6TGACA AGTGCCAGCT GAAGGGGGAG GCCATGCATG GGCAGGTGGA CTGCTCCCCT 
GGCATCTGGC AGCTGGCCTG CACCCACCTG GAGGGCAAGG TGATCCTGGT GGCTGTGCAT 
GTGGCCTCCG GCTACATTGA GGCTGAGGTG ATCCCTGCTG AGACAGGCCA GGAGACTGCC 
TACTTCCTGC TGAAGCTGGC TGGCAGGTGG CCTGTGAAGA CCATCCACAC TGCCAATGGC 
TCCAACTTCA CTGGGGCCAC AGT6AGG6CT GCCTGCTGGT GGGCTGGCAT CAAGCAGGAG 
TTTGGCATCC CCTACAACCC CCAGTCCCAG GGGGTGGTGG CCTCCATGAA CAAG6AGCTG 
AAGAAGATCA TTGGGCAGGT GAGGGACCAG GCTGAGCACC TGAAGACA6C TGTGCAGATG 
GCTGTGTTCA TCCACAACTT CAAGAGGAAG GGGGGCATCG GGGGCTACTC CGCTGGGGAG 
AGGATTGTGG ACATCATTGC CACAGACATC CAGACCAAGG AGCTCCAGAA GCAGATCACC 
AAGATCCAGA ACTTCAGGGT GTACTACAGG GACTCCAGGA ACCCCCTGTG GAAGGGCCCT 
GCCAAGCTGC TGTGGAAGGG GGAGGGGGCT GTGGTGATCC AGGACAACTC TGACATCAAG 
GTGGTGCCCA GGAGGAAGGC CAAGATCATC AGGGACTATG GCAAGCAGAT GGCTGGGGAT 
GACTGTGTGG CCTCCAGGCA GGATGAGGAC TAA 
SEQ ID NO: 38 
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FIGOSB 34A 



Met 


Gly 


aXo Axg 


Ala 
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Val 
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Ser 


Glv 


Gly 


Glu Leu Asp Lys Trp 


Glu 


Lys 


lie 


Arg 


ijcu ax y 


Pro Gly Gly Lys 
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Lvs 
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Lys Leu Lys His He 


Val 


Trp 


Ala 


Ser 


AX^ I9XU 


Leu 


Glu 


Arg 


Phe 


Ala 


Val 


Asn 


Pro Gly Leu Leu Glu 


Thr 


Ser 


Glu Gly 


\jys Axg 


Gin 


He Leu Gly 


Gin 


Leu 


Gin 


Pro Ser Leu Gin Thr 


Gly 


Ser 


Glu 


Glu 


Xieu Axg 


Ser 


Leu 


Tyr 


Asn 


Thr 


Val 


Ala 


Thr Leu Tyr Cys Val 


His 


Gin 


T.I re 

jjys 


Tl ^ 


Asp vax 


Lys Asp Thr Lys 


Glu 

vx w 


Ala 


Leu 


Glu Lys He Glu Glu 


Glu 


Gin 


Asn 


Liys 


oex Xiys 


T.VQ 


Lys 


Ala 


Gin 


Gin 


Ala 


Ala 


Ala Gly Thr Gly Asn 


Ser 


Ser 




vajL 


oex uxn 


Asn 


Tyr 


Pro 


He 


Val 


Gin 


Asn 


Leu Gin Gly Gin Met 


Val 


His- 




/VXCl 


xxe OCX 


Pro 


Arg 


Thr 


Leu 


Asn 


Ala 


Tro 


Val Lys Val Val Glu 


Glu 


Lys 


AJLo 




Ser Pro 


Glu 


Val 


He 


Pro 


Met 


Phe 


Ser 


Ala Leu Ser Glu Gly 


Ala 


Thr 






AB^ XfCU 


Asn 


Thr 


Met 


Leu 


Asn 


Thr 


Val 


Gly Gly His Gin Ala 

• 


Ala 


Met 






T>OD TiVB 
Xi\?U XJ jr a 


Glu 


Thr 


lie 


Asn 


Glu 


Glu 


Ala 


Ala Glu Trp Asp Arg 


Leu 


His 


iriO 


vox 


His Ala 

nxo xijaCi 


Gly 


Pro 


He 


Ala 


Pro 


Gly 


Gin 


Met Arg Glu Pro Arg 


Gly 


Ser 




Tl 0 


Al a Glv 
^\x a vx jr 


Thr 


Thr 


Ser 


Thr 


Leu 


Gin 


Glu 


Gin He Gly Trp Met 


Thr 


Asn 


ASH 




P*ro X le 


Pro 


Val 


Gly Glu 


He 


Tyr 


Lys 


Arg Trp He He Leu 


Gly 


Leu 


ASXl 


ijys 


Tie Val 


Arg 


Met 


Tyr 


Ser 


Pro 


Thr 


Ser 


He Leu Asp He Arg 


Gin 


Gly 






fil 11 

V3X U * i W 


Phe 


Arg 


Asp 


Tyr 


Val 


Asp 


Arg 


Phe Tyr Lys Thr Leu 


Arg 


Ala 




Cln 


Al a <?9r- 

^IXGl 0CX 


Gin 


Glu 


Val 


Lys 


Asn 


Trp 


Met 


Thr Glu Thr Leu Leu 


Val 


Gin 




2il a 


Asn Pro 


Asp 


Cys 


Lys 


Thr 


He 


Leu 


Lys 


Ala Leu Gly Pro Ala 


Ala 


Thr 


Jjcu 


nl 11 


Glu Met 


Met 


Thr Ala Cys Gin Gly Val Gly Gly Pro Gly His 


Lys 


Ala 


TV ^^^T 

Arg 


V aX 


Leu Ala 


Glu 


Ala 


Met 


Ser 


Gin 


Val 


Thr 


Asn Ser Ala Thr He 


Met 


Met 




AX 9 


Gly Asn 


Phe 


Arg 


Asn 


Gin 


Arg 


Lys 


Thr 


Val Lys Cys Phe Asn 


Cys 


Gly 


Lys 


vax 


Gly His 


He Ala Lys Asn Cys Arg Ala Pro Arg Lys Lys Gly 


Cys 


Trp 


iiys 


L.ys 


Gly Lys 


Glu 


Gly 


His 


Gin 


Met 


Lys 


Asp 


Cys Asn Glu Arg Gin 


Ala 


Asn 


riie 


xieu 


Gly Lys 


He 


Trp 


Pro 


Ser 


His 


Lys 


Gly Arg Pro Gly Asn Phe 


Leu 


Gin 


C 


AX y 


Pro Glu 


Pro 


Thr 


Ala 


Pro 


Pro 


Glu 


Glu 


Ser Phe Arg Phe Gly 


Glu 


Glu 


jjys 


rnVt w> 
i. IIX 


Thr Pro 


Ser 


Gin 


Lys 


Gin 


Glu 


Pro 


He 


Asp Lys Glu Leu Tyr 


Pro 


Leu 






Leu Arg Ser Leu Phe Gly Asn 


Asp 


Pro 


Ser Ser Gin Met Ala 


Pro 


He 


Ser 


Pro 


He Glu 


Thr 


Val 


Pro 


Val 


Lys 


Leu 


Lys 


Pro Gly Met Asp Gly 


Pro 


Lys 


Val 


Lys 


Gin Trp 


Pro 


Leu 


Thr 


Glu 


Glu 


Lys 


He 


Lys Ala Leu Val Glu 


He 


Cys 


Thr 


Glu 


Met Glu 


Lys 


Glu Gly 


Lys 


He 


Ser 


Lys 


He Gly Pro Glu Asn 


Pro 


Tyr 


Asn 


Thr 


Pro Val 


Phe 


Ala 


He 


Lys 


Lys 


Lys 


Asp 


Ser Thr Lys Trp Arg 


Lys 


Leu 


Val 


Asp 


Phe Arg Glu Leu Asn 


Lys 


Arg 


Thr 


Gin 


Asp Phe Trp Glu Val 


Gin 


Leu 


Gly 


lie 


Pro His 


Pro 


Ala 


Gly 


Leu 


Lys 
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FIGURE 34B 

Asp Lys Trp Thr Val Gin Pro lie Val Leu Pro Glu Lys Asp Ser Trp Thr Val 
Asn Asp lie Gin Lys Leu Val Gly Lys Leu Asn Trp Ala Ser Gin lie Tyr Pro 
Gly lie Lys Val Arg Gin Leu Cys Lys Leu Leu Arg Gly Thr Lys Ala Leu Thr 
Glu Val lie Pro Leu Thr Glu Glu Ala Glu Leu Glu Leu Ala Glu Asn Arg Glu 
He Leu Lys Glu Pro Val His Gly Val Tyr Tyr Asp Pro Ser Lys Asp Leu He 
Ala Glu He Gin Lys Gin Gly Gin Gly Gin Trp Thr Tyr Gin He Tyr Gin Glu 
Pro Phe Lys Asn Leu Lys Thr Gly Lys Tyr Ala Arg Met Arg Gly Ala His Thr 
Asn Asp Val Lys Gin Leu Thr Glu Ala Val Gin Lys He Thr Thr Glu Ser He 
Val He Trp Gly Lys Thr Pro Lys Phe Lys Leu Pro He Gin Lys Glu Thr Trp 
Glu Thr Trp Trp Thr Glu Tyr Trp Gin Ala Thr Trp He Pro Glu Trp Glu Phe 
Val Asn Thr Pro Pro Leu Val Lys Leu Trp Tyr Gin Leu Glu Lys Glu Pro He 
Val Gly Ala Glu Thr Phe Tyr Val Ala Gly Ala Ala Asn Arg Glu Thr Lys Leu 
Gly Lys Ala Gly Tyr Val Thr Asn Arg Gly Arg Gin Lys Val Val Thr Leu Thr 
Asp Thr Thr Asn Gin Lys Thr Ala Leu Gin Ala He Tyr Leu Ala Leu Gin Asp 
Ser Gly Leu Glu Val Asn He Val Thr Ala Ser Gin Tyr Ala Leu Gly He He 
Gin Ala Gin Pro Asp Gin Ser Glu Ser Glu Leu Val Asn Gin He He Glu Gin 
Leu He Lys Lys Glu Lys Val Tyr Leu Ala Trp Val Pro Ala His Lys Gly He 
Gly Gly Asn Glu Gin Val Asp Lys Leu Val Ser Ala Gly He Arg Lys Val Leu 
Phe Leu Asp Gly He Asp Lys Ala Gin Asp Glu His Glu Lys Tyr His Ser Asn 
Trp Arg Ala Met Ala Ser Asp Phe Asn Leu Pro Pro Val Val Ala Lys Glu He 
Val Ala Ser Cys Asp Lys Cys Gin Leu Lys Gly Glu Ala Met His Gly Gin Val 
Asp Cys Ser Pro Gly He Trp Gin Leu Ala Cys Thr His Leu Glu Gly Lys Val 
He Leu Val Ala Val His Val Ala Ser Gly Tyr He Glu Ala Glu Val He Pro 
Ala Glu Thr Gly Gin Glu Thr Ala Tyr Phe Leu Leu Lys Leu Ala Gly Arg Trp 
Pro Val Lys Thr He His Thr Ala Asn Gly Ser Asn Phe' Thr Gly Ala Thr Val 
Arg Ala Ala Cys Trp Trp Ala Gly He Lys Gin Glu Phe Gly He Pro Tyr Asn 
Pro Gin Ser Gin Gly Val Val Ala Ser Met Asn Lys Glu I^eu Lys Lys He He 
Gly Gin Val Arg Asp Gin Ala Glu His Leu Lys Thr Ala Val Gin Met Ala Val 
Phe He His Asn Phe Lys Arg Lys Gly Gly vile Gly Gly Tyr Ser Ala Gly Glu 
Arg He Val Asp He He Ala Thr Asp He Gin Thr Lys Glu Leu Gin Lys Gin 
He Thr Lys He Gin Asn Phe Arg Val Tyr Tyr Arg Asp Ser Arg Asn Pro Leu 
Trp Lys Gly Pro Ala Lys Leu Leu Trp Lys Gly Glu Gly Ala Val Val He Gin 
Asp Asn Ser Asp He Lys Val Val Pro Arg Arg Lys Ala Lys He He Arg Asp 
Tyr Gly Lys Gin Met Ala Gly Asp Asp Cys Val Ala Ser Arg Gin Asp Glu Asp 
SEQ ID NO: 39 
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